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AESTRACT 


In  the  absence  of  full  infcrraati.cn  regarding  service 
tiroes  cf  jobs  in  a  system,  analysis  and  simulation  studies 
have  shown  that  performance  can  be  improved  by  using 
whatever  predictive  information  is  available.  Empirical 
studies  of  actual  systems  have  found  that  the  overall 
service  time  distribution  of  jobs  is  highly  skewed  with  a 
very  long  tail.  However,  algorithms  which  use  distribution 
information  can  make  tetter  use  of  distributions  with  lower 
variance.  In  this  thesis,  we  investigate  the  effects  of 
conditioning  service  time  distributions  cn  cha racterist ics 
cf  jobs  known  before  the  jobs  are  processed.  The  resulting 
conditional  distributions  are  compared  tc  determine  which 
characteristics  are  the  best  predictors  of  actual  service 
time.  We  attempt  to  fit  these  dist ribut ic ns  tc  known 
theoretical  distributions.  The  implications  of  the 
empirical  distributions  and  their  associated  mean  residual 
life  and  rank  functions  in  the  choice  of  a  scheduling 
algorithm  are  considered. 
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CHAPTER  I 


Introduction 


In  cur  modern  society,  an  increasing  amount  of  our 
workload  is  being  handled  by  computers,  Eecause  of  the  cos 
of  using  computers,  much  research  has  dwelt  cn  the  problem 
of  producing  efficient  computer  systems.  An  important  area 
in  this  regard  is  the  scheduling  of  the  processing  of  jobs 
that  have  arrived  at  a  system.  This  scheduling  attempts  to 
maximize  the  utilization  of  computer  resources  to  provide 
fast  turnaround  to  the  user,  at  minimum  cost.  However, 
scheduling  disciplines  being  used  in  actual  systems  may  net 
be  optimal.  Before  a  new  scheduling  discipline  can  replace 
an  existing  one  that  performs  reasonably  well,  much 
analytical  and  empirical  study  must  be  centred  cn  the 
assumptions  of  the  discipline  and  its  proposed  capabilities 
Empirical  investigation  into  a  particular  area  of  this 
subject  is  the  intention  of  this  study. 

Eefore  elaborating  on  this  area,  we  present  some 
definitions  relevant  to  our  topic  and  necessary  tc  readers 
interested  in  this  study  but  unfamiliar  with  the 
terminology. 
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Definitions 
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We  present  three  general  definitions  followed  by  (a) 
scheduling  rules,  (b)  associated  functions,  (c) 
distributions,  and  (d)  classi f ica tions  of  distributions, 

(1)  Scheduling  discipline:  a  method  used  to 
determine  the  crder  in  which  jobs  are  processed  in  a  system. 
This  order  can  be  determined  through  a  selection  rule 
(selection  discipline)  which  is  dependent  upon  the  value  cf 
a  function  involving  attributes  of  an  individual  job 
(request) . 

(2)  Optimal  scheduling  discipline:  a  scheduling 
discipline  which  guarantees  tie  most  desirable  value  cf  some 
particular  measure  of  performance. 

(3)  Preemption:  the  interruption  cf  the  processing 
of  a  job  before  its  completion.  The  processing  of  the  job 
may  be  continued  later  from  its  point  of  progress  at  the 
time  of  preemption. 

A.  Scheduling  Disciplines 

(1)  FIFO:  first  in,  first  out.  The  job  arriving 
first  is  processed  first. 

(2)  LIFO:  last  in,  first  out.  The  jot  arriving  last 
is  processed  first. 

(3)  PLIFO:  preemptive  last  in,  first  out.  The  job 


arriving  last  is  given  highest  priority  and  preempts  a  job 
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currently  in  service. 

(4)  RAND:  random  selection.  The  job  tc  be  processed 


is  se 

lected 

at  ra 

ndom 

frem  th 

ose 

wa 

it  ing 

for  service. 

(5)  RR:  r 

cund 

rcb in. 

Job 

s 

wait  in 

g  for  service  are 

queue 

d.  The 

job 

at  the  head 

of  a 

q 

ueue  i 

s  processed  for  a 

quant 

um  of  t 

ime. 

If 

it  does 

not 

CO 

mplete 

within  that 

q  uant 

um,  it 

joins 

the 

end  of 

the 

qu 

eue . 

There  are  a  number 

of  va 

ria ticn 

s  of 

RR: 

(a)  SCBR:  single  quantum  round  icbin.  Fach  job 
receives  a  set  quantum  as  described  above. 

(b)  HQBR :  multiple  quantum  round  rcbin.  A  job 
is  processed  until  completion  or  until  it  receives  one  full 
quantum  during  which  no  jobs  arrive.  If  it  has  not 
completed  in  the  latter  case,  it  joins  the  end  of  the  queue 
as  in  SQRR. 

(c)  CYCRR:  cycle  oriented  round  rcbin.  A  job 
is  processed  for  a  quantum  length  which  is  inversely 
proportional  to  the  number  of  jobs  waiting  in  the  queue  at 
the  instant  the  time  is  allocated. 

(6)  FEn:  feedback  to  n  levels.  A  job  cn  arrival  is 
assigned  to  a  highest  priority  queue.  The  jot  selected  for 
service  is  chosen  from  the  highest  priority  ncn-empty  queue, 
and  is  serviced  for  a  quantum.  If  it  does  net  complete 
within  that  time,  it  joins  the  end  of  the  queue  with  the 


.  This  is  similar  to  RR 


next  lower  priority. 

(7)  PS:  processor  sharing 


t 
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but  with  the  quantum  length  tending  tc  zero.  The  processor 
is  divided  equally  among  all  jobs  in  the  system. 

(8)  PPS:  preemptive  processor  sharing.  This  is  like 
FEn  with  the  quantum  length  tending  to  zero  at  every  gueue 
level . 

{9)  SPT:  shortest  processing  time.  The  job  with  the 
shortest  processing  time  is  serviced  to  completion. 

(10)  SEPT:  shortest  remaining  processing  time.  The 
job  with  the  shortest  remaining  processing  time  has  highest 
priority.  If  a  jcb  arrives  whose  processing  time  is  shorter 
than  the  remaining  processing  time  of  the  job  currently  in 
service,  the  arriving  job  preempts  and  is  serviced. 

(11)  SEPT:  shortest  expected  processing  time.  The 
job  with  the  shortest  expected  processing  time  is  processed 
tc  completion. 


(12)  SEBPT :  shortest  expected  remaining  processing 
time.  This  is  the  same  as  SEPT  except  that  the  expected 
remaining  processing  time  is  used  rather  than  the  actual 
remaining  processing  time. 

(13)  SR:  smallest  rank.  The  job  with  current 
smallest  rank  is  given  highest  priority.  The  rank  of  a  job 
is  calculated  from  the  service  tiie  distribution  of  that 
job,  as  a  function  of  attained  service.  Interval  rank  is 
the  ratio  of  A  to  E  where  A  is  the  expected  investment  of 
processor  time  in  allowing  a  job  up  to  e  more  units  after  it 
has  attained  t  units  of  service  without  completing,  and  B  is 


and  B 
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the  linear  loss  rate  times  the  expected  payoff  in  allowing  a 
job  up  to  e  more  units  of  processor  time.  The  expected 
payoff  is  the  probability  that  the  job  will  complete  within 
the  next  e  units  of  service.  The  rank  cf  a  job  is  defined 
as  the  minimum  interval  rank  for  a  quantum  of  any  length  e 
(Sevcik  1971) .  This  discipline  assumes  that  an  arriving  job 
will  belong  to  a  particular  request  class  with  a  processing 
time  distribution  different  from  those  of  ether  request 
classes.  The  request  class  is  determined  from  the 
characteristics  of  the  arriving  job.  For  example,  jobs 
requiring  60K  of  core  may  have  a  distribution  different  from 
those  jobs  requiring  130K  of  core. 


E.  Associated  Functions 

Associated  with  any  job 
o r  waiting  cost  function.  It 
value  of  service  given  a  job, 
to  completion  after  a  certain 


in  a  system  is  a  loss  function 
indicates  the  decrease  in  the 
from  an  immediate  completion 
time  interval.  Less  functions 
storing  information  relevant 
external  priorities,  upon 
upon  other  external 
during  the  servicing  of  a 

,  its  loss  function  is 
cf  loss  functions  are: 


are  dependent  upon  the  cost  of 
tc  a  request  being  serviced,  upon 
the  usefulness  of  the  result,  and 
factors.  If  information  created 
job  can  be  retained  at  negligible  cost 
monotone  non-decreasing.  Three  types 
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(1)  linear.  The  less  is  proportional  to  the  time  of 
the  job  in  the  system. 

(2)  convex.  The  second  derivative  cf  the  loss 
function  is  nennegative. 

(3)  concave.  The  second  derivative  of  the  loss 
function  is  nonpositive. 


Sch 

eduling 

disciplines  can 

be 

classified  according 

to 

the 

loss 

function 

s  they  presume. 

S 

PT/c  assumes  a 

linear 

loss 

function  is 

associated  vith 

ea 

ch  job.  It  is 

dif f ere 

n  t 

from 

SPT 

in  that 

in  choosing  the 

hi 

ghest  priority 

job  to 

be 

serviced,  the  smallest  ratio  of  the  processing  time  of  a  job 
to  its  less  function  value,  c,  is  considered,  rather  than 
the  shortest  processing  time.  SPRT/c,  SEPT/c,  SFHPT/c  are 
defined  similarly. 

C .  Distributions 

Other  factors  influencing  scheduling  disciplines  are 
interarrival  distributions  and  processing  dist r ibut ions. 
These  distributions  may  assume  various  shapes.  Pelow  are 
given  the  density  and  cumulative  distribution  functions  of 
the  theoretical  distributions  considered  in  this  study.  All 
of  the  functions  have  values  of  zero  vhen  t  is  negative. 
Figure  1  gives  the  general  shape  cf  the  distributions. 
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(1)  Exponential  distribution.  Its  density  function 


is  given  as: 


f  (t)  =  Ae 


-At 


for  t  >  0,  and  cumulative 


distribution  function  as: 

F  (t)  =  1  -  e"At 


for  t  >  C 


(2)  Hyperexponential  distribution.  This  distribution 
is  the  sum  of  exponentially  distributed  random  variables. 

The  density  function  for  the  two  parameter  case  is: 

f  (t)  =  p  ^e  ♦  (1—  p)  {  A2e  for  t  >  0, 


The  cumulative  distribution  function  is: 


F  (t)  =  1  -  pe  ^  -  (l-p)e'^ 


for  t  >  0. 


(3)  Erlang-k  distribution.  The  density  function  is: 


f(t)  =  (kA)^  ) 

fR-TTf 


for  t  >  0, 

and  k  an  integer  >  1.  The  cumulative  distribution  function 


is : 


-£ 


F (t)  =  \  f (x)  dx 

o 


for  t  >  0. 


This  function  is  too  complex  to  be  expressed  in  closed  form, 


(4)  Gamma  distribution.  The  density  function  is 
f(t)  =  t(A-l)  e't/B  for  t  >  C 

T7A)  BA 
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where  r>) 


- 


at 


for  all  A  >  0 


The  cumulative  distribution  function  is 


F (t)  =  \  f {x)  dx 
'o 


for  t  >  0. 


The  exponential  distribution  is  a  special  case  of  the  gamma 
distribution  when  A  =  1.  The  Erlang-k  distribution  is  also 
a  special  case  of  the  gamma  distribution  where  A  is  an 
integer  >1  (k  =  A,  A=  1/ (AE)  where  k  and  A  are  the 
parameters  of  the  Erlang-k  distribution) . 


(5)  Seibull  distribution.  The  density  function  is 

f  < t)  =  (A/B)e  (t/E)  if  t  >  0,  A  >  0, 

E  >  0. 


The  cumulative  distribution  function  is: 


Mt)  =  1 


e 


if  t  >  0,  A  >  C, 


B  >  C. 


The  exponential  distribution  is  a  special  case  of  the 
Weibull  distribution  when  A  =  1. 

y 

(6)  Lognormal  distribution.  A  random  variable  Y  =  e 
in  which  X  is  normally  distributed  with  parameters  m  and  or 
is  said  to  have  a  lognormal  distribution  with  parameters  m 
and 
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(7)  Beta  distribution.  The  density  function  is: 
f(t)  =  (f(AtE)  )/ (T’(A)  T(E>  )  (1-t)8'1 

for  C  <  t  <  1. 

The  cumulative  distribution  function  is: 

c* 

F (t)  =  \  f < x )  dx  for  0  <  t  <  1. 

The  density  and  cumulative  distribution  functions  have  value 
zero  when  t  >  1.  It  should  be  noted  that  the  Eeta 
distribution  is  finite. 

C .  Distribution  Classifications 

Service  time  distributions  can  be  characterized  by: 

(1)  Failure  Rate(FR).  This  is  the  probability  that  a 
job  with  a  certain  amount  of  attained  service  will  complete 
in  the  next  instant.  Failure  rate  can  be: 

(a)  increasing,  e.g.,  FR  of  an  Erlang-k 
distribution, 

(b)  decreasing,  e.g.,  FR  of  a  hyperexponential 
or  a  Weibull  distribution, 

(c)  constant,  e.g.,  FB  of  an  exponential 


distribution 
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(See  Figure  2.)  It  is  defined  as: 

FR  (t)  =  f  (t)/ {1-F  (t) )  where  f  (t)  and  F(t)  are 

the  density  function  and  cumulative  distribution  function, 
respectively. 

(2)  Mean  Residual  life  (MRI) .  This  is  the  expected 
remaining  service  time  of  a  job  when  it  has  attained  a 

certain  amount  of  service,  defined  as: 

p  oo  poo 

MRI  ( t )  =  (  \  (x-t)  f  (x)  dx)/ (  \  f(x)dx) 

t  t 

where  f (x)  is  the  distribution  function.  It  may  be: 

(a)  increasing,  e.g.,  MRL  of  a  hyperexponential 
or  a  Weibull  distribution, 

(b)  decreasing,  e.g.,  MRL  of  an  Frlang-k 
distribution , 

{ c )  constant,  e.g.,  MRL  of  an  exponential 
d istr ibu ticn. 

(3)  Rank.  Its  dynamics  are  the  same  as  these  for 

MRL. 

Source  of  the  Problem 

Concern  over  the  manner  in  which  jobs  pass  through  a 
computer  system  has  led  to  the  development  of  a  number  of 
scheduling  disciplines.  Eefore  the  concepts  of 


FIGURE  2.  FAILURE  RATES  OF  THEORETICAL  DISTRIBUTIONS 
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mu ltiprogramming  ard  preemption  were  realized, 
cculd  be  handled  by  the  operating  system  at  any 
Other  arriving  jobs  remained  in  queues  until  th 
had  been  processed  to  completion.  Scheduling  d 
were  relatively  simple  and  easy  tc  analyze. 

Current  systems,  however,  allow  more  than 
task  at  any  one  time  and  the  interruption  and  1 
resumption  cf  the  processing  of  a  job.  Schedul 
become  difficult.  The  central  processor  as  cpp 
input/output  devices,  is  still  often  a  scarce  r 
more  than  one  processor  may  be  in  operation. 

The  difficulty  of  scheduling  is  illustrat 
by  the  number  of  objectives  which  must  be  kept 
Seme  of  these  are: 


(1) 

mi 

nimum 

responi 

~e  time 

(2) 

ma 

x  i  m  u  m 

system 

throughput 

(3) 

re 

duce  d 

number 

of  jobs 

in  the  system 

(4) 

ac 

knowle 

dgeraen 1 

t  of  job 

priorities 

(5) 

lo 

w  cost 

of  implemental 

t  icn. 

only  one  job 

one  time, 
e  current  job 
iscipli nes 

cne  ongoing 
a  ter 
ing  has 
csed  tc 
esource  but 

ed  somewhat 
in  mind. 


To  determine  how 
disciplines  have 
following  ways: 
and  (3)  empiric 
will  categorize 


well 

the 

se  objectives 

are 

me 

t. 

teen 

s  tu 

died  in  one  cr 

mo 

re 

cf 

(1) 

ana 

lytically,  (2) 

thro 

ugh 

ally. 

In 

the  remainder 

cf 

th 

is 

the  sched 

uling  discipli 

nes 

in 

tc 

scheduling 
t  he 

simulation 
chapter  we 
three 
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classes  and  then  in  each  discuss  the  research  conducted 
employing  these  methods. 

Types  of  Disciplines  Formulated 

For  research  purposes,  scheduling  rules  have  often 
been  distinguished  according  to  the  amount  and  kind  of 
information  about  a  job  needed  to  apply  the  scheduling  rule. 
There  are  basically  three  classifications: 

(1)  those  rules  that  require  no  information  about  the 
processing  times  cf  jobs, 

{2)  those  rules  that  require  exact  information  about 
the  processing  times  of  jobs, 

(3)  those  rules  that  require  knowledge  cf  the 
probability  distributions  of  the  processing  times  of  jobs. 

The  first  class,  i.e.,  those  rules  that  require  no 
information  about  a  job,  can  be  subdivided  into  classes  that 
use  no  information  gained  about  a  job  during  its  processing, 
and  those  that  do.  In  the  first  group  are  FIFO,  LIFO, 

F LIFO ,  and  RAND.  None  of  these  discriminates  among  jobs  on 
the  basis  of  their  length.  In  the  second  group  are  RB,  PS, 

P PS ,  and  FEn.  These  disciplines  do  discriminate  among  jobs 
by  allowing  short  jobs  to  be  completed  before  longer  jobs. 

The  second  class,  i.e.,  those  rules  that  require  exact 
information  about  the  processing  times  cf  jobs,  can  be 
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subdivided  into  ncn-preempti ve  and  preemptive  disciplines. 
These  include  SPT  and  SRPT,  respectively.  Foth  process  jobs 
in  the  order  of  their  length,  shortest  to  longest. 

The  third  class,  i.e.,  those  rules  that  require 
knowledge  of  the  probability  dist ribut ions  of  the  processing 
times  of  jobs,  can  also  be  subdivided  into  non- preemptive 
and  preemptive  disciplines.  The  first  includes  SEPT;  the 
second  includes  SERPT  and  SR. 

Analytical  Studies 

Assumptions  and  Performance  Measures  of  Scheduling  Models 

Scheduling  disciplines  in  each  of  the  three  groups 
have  been  studied  under  various  constraints  and  with  various 
performance  measures  in  mind.  Coffman  {1967),  Coffman  & 
Kleinrock  (1968) ,  and  McKinney  (1969)  have  provided  surveys 
of  the  models  used  and  results  obtained  analytically,  prior 
to  1969.  McKinney  (1969)  has  pointed  out  that  scheduling 
models  of  time-sharing  systems  are  stochastic  and  thus  the 
models  depend  on  queuing  theory.  Models  may  differ  from 
each  other  in  the  following  parameters: 

(1)  number  of  scarce  resources,  usually  the 
processor , 

(2)  distribution  of  job  arrivals,  e.g.,  general  or 
Pcissc  n. 
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(3)  handling  of  overhead  { preem pticn)  , 

(4)  quantum  assumption, 

(5)  processing  time  distribution,  e.g.,  exponential 
or  general,  and  degree  of  knowledge  of  this  distribution, 

(6)  setup  times, 

(7)  external  priorities,  and 

(8)  scheduling  discipline. 

Measures  of  performance  obtained  from  these  models  include: 


(1) 

average 

response  time. 

(2) 

average 

waiting  time. 

(3) 

average 

queue  length. 

(4) 

maximum 

waiting  time,  and 

(5) 

weighted 

response  times  (linear 

Simple  models  have  been  found  to  portray  accurately  actual 
time-sharing  system  performance  (Scherr  1567) . 

Optimality  of  Scheduling  tisciplines 

An  important  area  has  been  the  optimality  of 
scheduling  disciplines  in  the  various  models.  This 
optimality  may  be  examined  from  the  viewpoint  of  minimizing 
total  expected  losses  which  is  the  same  as  maximizing  the 
value  of  service  rendered.  We  shall  outline  seme  of  the 
optimality  results  that  have  been  obtained  through  research. 
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on  each  of  the  three  classifications  of  scheduling 
disciplines  discussed  earlier. 

For  the  class  of  scheduling  disciplines  with  no 
initial  information  about  the  processing  times  cf  jobs, 
Schrage  (1S70)  has  found  that: 

(1)  if  no  preemption  is  allowed, 

(a)  FIFO  is  optimal  for  identical  convex  costs 
(i.e.,  every  jet  has  the  same  loss  function 
which  in  this  case  is  convex)  and  arbitrary, 
independent  interarrival  and  processing  times, 
and 

(2)  if  preemption  is  allowed, 

(a)  PFS  is  optimal  for  identical  linear  costs 
arbitrary,  dependent  interarrival  times,  and 

a  decreasing  failure  rate, 

(b)  FIFO  is  optimal  for  identical  convex 
costs,  arbitrary,  dependent  interarrival  times, 
and  an  increasing  failure  rate, 

(c)  PLIFO  is  optimal  for  identical  ccncave 
costs,  arbitrary,  dependent  interarrival 
times,  and  exponential  processing  times,  and 
(<3)  RE  is  good  but  not  necessarily  optimal 
for  identical  convex  costs,  arbitrary, 
dependent  interarrival  times,  ard  a  decreasing 


failure  rate. 
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For  the  class  of  scheduling  disciplines  with  exact 
information  cn  the  service  times  cf  jobs.  Smith  (1956,  cited 
in  Sevcik  1971)  and  McNaughton  (1959)  among  others  have 
proven  SPT/c,  where  c  is  a  loss  constant,  optimal  with 
preemption  disallowed  and  no  arrivals.  With  preemption 
allowed  and  identical  linear  loss  functions,  Schrage  (1968) 
has  proven  SFPT  optimal.  SRPT/c  is  after  the  last  arrival 
of  a  busy  period  when  preemption  is  allowed.  Under  Poisson 
arrivals,  SRPT/c  is  optimal  in  minimizing  the  expected  rate 
of  total  loss  (as  opposed  to  total  losses)  (Sevcik  1971) . 

Finally,  we  consider  the  class  of  scheduling 
disciplines  with  Knowledge  of  the  service  time 
distributions.  Rothkopf  (1966)  and  others  have  proven 
SEPT/c  optimal,  when  preemption  is  disallowed  and  there  are 
nc  arrivals.  Sevcik  (1971)  has  proven  SR  optimal  in 
minimizing  the  expected  rate  cf  total  less  under  linear 
losses  and  Poisson  arrivals,  within  a  class  of  disciplines. 
It  is  conjectured  that  the  optimal  rule  must  lie  within  that 
class. 

In  the  above  cases,  SR  has  been  shown  to  specialize  to 
the  optimal  scheduling  discipline. 

For  more  general  models  than  these,  scheduling 
problems  become  sc  complex  that  few  optimality  results  have 


been  obtained. 
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These  three  classes  of  scheduling  disciplines  have 
been  investigated  as  to  their  performance  under  variables 
such  as  interarrival  times,  processing  time  distributions, 
utilization  of  the  system,  and  loss  functions.  We  will  new 
present  seme  of  the  important  studies  in  this  area. 

Comparative  Performance  of  Optimal  cr  Good  Scheduling 

Disciplines 

Schrage  (1969) ,  Tran  (1972)  ,  and  Schrage  5  Hiller 
(1966,  cited  in  Schrage  1969)  have  investigated  the  effects 
of  various  conditions  on  scheduling  disciplines.  Schrage 
has  considered  a  system  with  a  single  limited  resource,  the 
processor,  exponential  interarrival  times,  and  identical 
linear  loss  functions.  He  has  compared  the  following 
scheduling  disciplines:  FIFO,  SPT,  SBPT,  HR,  and  FB  with 
respect  to  the  overall  average  response  time  and  system 
lead,  and  has  found  that  FIFO,  FR,  and  FB  perform 
eguivalen tly,  but  that  the  rules  using  information  about 
processing  times,  SPT  and  SFPT,  improve  system  performance 
greatly  when  a  system  is  heavily  loaded.  In  addition,  he 
has  investigated  the  effect  of  an  increasing  coefficient  of 
variation  and  has  found  that  full  information  (SPT/c  versus 
FIFO)  and  preemption  (FR  versus  FIFO  and  SRPT  versus  SPT/c) 
become  more  valuable.  FIFO  and  FE  are  very  distribution 
sensitive,  with  SPI/c,  RR,  and  SRPT,  less  so. 
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Tran  (1972)  has  simulated  a  system  under  the 
assumptions  of  a  single  processor,  Poisson  arrivals,  linear 
less  functions,  with  and  without  overhead  cf  preemption,  and 
hyperexponential  and  bimocal  normal  processing  time 
distributions.  He  compares  three  families  of  scheduling 
disciplines:  (1)  SPT,  including  preemption,  (2)  PR, 

including  variations  such  as  SQRR,  CYC RR ,  and  MQRR ,  and  (3) 
FEn,  with  variations  on  the  value  n,  with  respect  to  mean 
response  time,  waiting  time  distribution,  and  response  time 
distribution  conditioned  on  service  time.  His  results 
reveal  that  the  SFT  family  is  dominant  as  the  scheduling 
strategy  producing  the  "best”  mean  response  times,  waiting 
time  distributions,  and  response  time  distributions  both 
with  and  without  overhead.  They  also  indicate  that  response 
time  distributions  are  significantly  affected  by  service 
time  distributions.  Tran  concludes  that  (1)  the  optimality 
of  disciplines  depends  a  great  deal  on  the  available 
information  about  service  times  (but  not  on  the  type  cf  the 
particular  service  time  distribution,  i.e.,  here, 
hyperexpenent ial  versus  bimodal  normal)  and  (2)  overhead 
costs  do  cause  significant  variation  in  the  relative 
performance  of  scheduling  disciplines. 


Schrage  and  Miller  have  used  information  about 
processing  times  indicating  which  jobs  are  short  and  which 
are  long.  They  apply  external  priorities  favoring  the  shert 
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Service  Time  Distribution  Studies 

Analytical  and  simulation  studies  primarily  assume 
that  service  time  distributions  have  general  exponential 
forms.  Studies  by  Greenberger  (1966) ,  Scherr  (1967) ,  and 
Kleinrock  (1967) ,  among  many  others,  have  assumed 
exponential  service  time  distributions. 


Tran 


(1972)  has 
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assumed  hyperexpcnential.  (He  also  considers  a  bimodal 
normal  distribution.) 

In  actual  empirical  studies  of  service  time 
distributions,  general  exponential  forms  do  appear,  but  not 
the  strictly  exponential  form  assumed  in  sc  many  models. 
Shemer  6  Heying  (1969)  have  studied  a  batch  time-sharing 
system  of  an  SDS  Sigma  5  and  Sigma  7  to  obtain  empirical 
support  for  the  validity  cf  their  proposed  mathematical 
model.  Without  displaying  any  of  the  service  time  results, 
they  assert  that  the  empirical  data  does  indicate  that 
exponential  service  time  distributions  is  a  valid  assumption 
for  the  model. 

However,  Mclsaac  (1965,  cited  in  Schrage  1969) ,  in  an 
S DC  study  of  an  IEH  7094-type  time-sharing  computer  has 
found  that  reguest  time  distributions  could  be  approximated 
by  a  log-normal  distribution.  The  coefficient  cf  variation 
for  the  reguest  times  in  this  study,  is  large,  viz., 
17.45/1.45.  Much  variability  exists  among  jobs  and  the 
distribution  is  skewed  to  the  right. 

Scherr  (1966)  has  studied  an  MAC  time-sharing  system 
on  an  IEM  7094  and  found  that  the  distribution  of  reguest 
times  has  a  mean  cf  .88  and  standard  deviation  of  .7.  The 
distribution  has  a  greater  probability  near  zero  than  has  an 


exponential  distribution  with  mean  .88.  Nc  fitting  of  the 
empirical  distribution  was  performed. 

In  a  later  SIC  study.  Fine,  Jackson,  S  Fclsaac  (1966) 
have  studied  the  number  of  instructions  executed  per 
request.  They  partition  requests  according  to  the  specific 
purpose  of  the  program,  considering  LISP,  FETA5,  GPDS,  TINT, 
and  STJHF  programs.  While  not  actually  studying  the 
distribution  of  service  times,  they  do  study  the  variability 
of  the  number  of  instructions  executed  per  reGuest.  Their 
results  indicate  that  coefficients  of  variation  range  from 
.57  to  2.3.  The  statistics  show  distinct  differences  among 
the  types  of  programs. 

A  JOSS  study  (Bryan  1967)  of  an  interpretive  system  on 
a  PPP-6  has  revealed  that  the  distribution  of  reguest  times 
per  session  has  a  by perexpcnen t ial  form  with  the  mode  at 
zero.  The  coefficient  of  variation  was  not  given  but  the 
median  was  7  seconds  while  the  mean  was  6.6  minutes  on  a 
typical  distribution,  showing  much  variability  in  processing 
time  of  requests. 

Fuchs  6  Jackson  (1970)  have  proposed  an  elementary 
model  of  the  communication  process  between  a  multiaccess 
computer  and  a  user  at  a  terminal.  They  support  this  model 
with  empirical  data  frcm  four  computer  systems, 
investigating  the  distribution  of  user  think  times,  computer 
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burst  times,  interburst  times,  in tercharacter  times,  and 
related  discrete  variables.  Using  parameters  estimated  from 
sample  means  and  variances,  they  have  found  that  all  of  the 
continuous  variables  can  be  represented  by  gamma 
distributions  in  all  four  systems,  and  some,  namely,  the 
user  think  time,  computer  burst  time,  and  interburst  time 
distributions,  can  be  alsc  represented  by  log-normal 
distributions.  However  their  computer  burst  times  included 
processing  times  plus  time  spent  in  queues  and,  in  seme 
cases,  times  between  terminal  “carriage  returns”  and  “line 
feeds”.  No  processing-ti me-cnly  distributions  were 
obtained . 

Anderson  (1972)  has  examined  the  service  time 
distributions  of  an  IBM  API  interactive  system  on  a  360/50 
with  the  intent  of  combining  these  and  other  empirical 
results  with  theoretical  research  tc  improve  system 
performance  through  an  improved  scheduling  algorithm.  He 
obtains  an  overall  distribution  of  user  input  and  then 
distributions  of  the  input  partitioned  intc  three  classes: 
compute  transactions,  commands,  and  output-bound 
transactions.  His  statistics  show  wide  variation  in  all  of 
the  classes.  The  overall  distribution  has  a  coefficient  cf 
variation  of  9.54,  and  the  three  partitions  have 
coefficients  of  variation  of  11.86,  6.57,  and  3.23, 
respectively.  All  have  long-tailed  distributiens. 


He  also 
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plots  the  mean  residual  service  time  distributions,  the 

concentration  curves,  and  the  completion  rates.  The  mean 

residual  life  is  increasing  for  all  input  classes.  This  is 

followed  by  attempts  to  fit  the  service  time  distribut ions 

« 

to  hyperexponential  and  Weibull  distributions,  using 
cumulative  proportions.  No  hyperexponential  distribution  is 
fit  to  the  data  since  its  parameters  cannot  be  fit 
satisfactorily,  by  the  method  of  moments.  The  Weibull 
distribution  is  used  with  parameters  estimated  ty  a 
regression  method  developed  by  Kac  (1960,  cited  in  Andersen 
1972).  No  single  Weibull  distribution  is  fit  but  a 
composite  of  Weibull  distributions  does  approximate  the 
empirical  distributions  fairly  closely,  although  not 
sufficiently  well  enough  to  satisfy  a  goodness-of-f it  test. 

Implications  of  Empirical  Studies 

Empirical  service  time  distributions  have  resembled 
hyperexpenen tial,  gamma,  Weibull,  log-normal  and  Waring 
distributions.  These  distributions  are  similar  in  that 
their  mean  residual  life  times  and,  consequently,  their 
ranks,  are  increasing  throughout  or  increasing  after  seme 
point  in  time.  This  can  be  attributed,  in  part,  to  the 
infinite  tails  of  the  distributions.  Sevcik  (1971)  points 
out  that  it  is  possible  to  base  service  time  distributions 
used  in  applying  a  scheduling  discipline  in  a  particular 
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system  on  past  empirical  studies  of  that  system.  Therefore, 
we  must  consider  the  effects  cf  these  distributions  on 
scheduling  disciplines.  When  rank  is  decreasing,  preemption 
is  not  necessary  to  obtain  minimal  expected  loss.  But  when 
rank  is  increasing,  preemption  is  desirable.  Costs  of 
preemption  may  become  significant  in  tie  choice  cf  a 
scheduling  discipline,  extremely  so  in  the  case  of  PS  and 
PPS,  less  so  in  FF  and  FB,  and  still  less  so  in  SEHPT. 

> 

One  other  observation  of  empirical  studies  is  of  major 
importance  in  considering  the  applicability  cf  SR  or  any 
other  distribution-dependent  discipline.  In  most  of  the 
studies,  much  variability  in  the  processing  tines  of  jobs 
has  appeared,  evident  in  the  larce  coefficients  of  variation 
and  the  very  long  tails  of  the  distributions.  Co nseguent ly , 
moments  of  the  distributions  are  guite  inaccurate. 

Occurences  in  the  tail  can  significantly  alter  the  mean, 
standard  deviation,  skewness,  and  kurtosis.  Other 
statistics  used  to  describe  distributions,  such  as  failure 
rate  and  mean  residual  life  are  also  affected.  As  mentioned 
above,  mean  residual  life  may  be  increasing  when  it  is 
desirable  to  have  it  decreasing.  Sevcik  (1971)  conjectures 
that  by  partitioning  arriving  jobs  into  classes  with  ccmmcn 
a  priori  characteristics,  distributions  having  lower 
coefficients  of  variation  than  the  overall  distribution  may 
be  obtained.  Anderson’s  work  (1972)  discussed  earlier,  dees 
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indicate  this  effect,  although  his  research  partitioned  the 
overall  distribution  only  on  the  basis  of  the  type  of  time¬ 
sharing  command.  The  coefficient  of  variation  of  the  main 
distribution  was  reduced  in  two  of  the  three  partitioned 
classes.  In  the  event  of  the  coefficient  of  variation  not 
being  reduced  significantly,  if  distinct  distributions  are 
obtained  in  the  partitioned  classes,  more  accurate 
information  is  known  about  the  processing  time  distribution 
of  the  jobs  within  each  class.  To  determine  if  information 
known  at  the  arrival  of  a  job  about  that  job — information 
such  as  priority,  amount  cf  core  required,  type  of  user — 
affects  the  distribution  cf  the  processing  times  cf  the 
jobs,  directed  research  is  necessary. 

In  the  remainder  of  this  thesis,  this  partitioning 
concept  will  be  investigated  in  a  batch  processing  system  at 
the  Hniversity  of  Toronto.  Chapter  II  presents  the  method 
used  to  gather  and  process  data  and  the  parameters 
considered  in  partitioning  the  data.  Measurements  used  to 
determine  differences  in  distributions  are  outlined. 

Chapter  III  reviews  the  experiments  conducted  on  the 
data  gathered  and  the  results  obtained  frcm  these 
experiments.  Eistinctive  or  unusual  observations  are 


me  nti oned . 
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Chapter  IV  outlines  the  conclusions  reached  based  on 
the  results  of  Chapter  ITT- 
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CHAPTER  II 


Objectives  of  Our  Investigation 

By  collecting  a  large  amount  cf  information  concerning 
jobs  in  an  actual  system,  we  hoped  to  obtain  the  service 
time  distribution  of  these  jobs,  and  then  condition  this 
distribution  on  one  or  pairs  of  characteristics  of  the  jobs. 
Our  objective  was  to  determine  which,  if  any,  of  the 
parameters  would  be  indicative  cf  the  service  time 
distribution  of  a  job.  To  determine  scheduling  performance 
implications,  we  tried  to  characterize  the  empirical 
distr ibutions  by  attempting  to  fit  them  to  theoretical 
distributions  and  to  calculate  their  associated  functions 
(i.e.,  mean  residual  life  and  smallest  rark) . 

Methods  Employed  in  the  Processing  of _ the  Data 

The  nature  cf  our  problem  requires  the  processing  of 
large  amounts  of  data  to  obtain  frequency  distribu tions  and 
associated  statistics  and  then  the  comparison  of  these  to 
known  probability  distributions.  Various  methods  are 
employed  in  the  areas  of  empirical  distribution  fitting  and 
measurement  of  differences  in  empirical  distributions. 

Before  discussing  the  actual  data  handling  procedures,  we 
give  descriptions  of  these  methods  below. 
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Distribution  Pitting 


A .  Smoothing 

Before  empirical  distributions  can  be  tested  against 
theoretical  distributions,  extreme  fluctuations  in  the 
frequency  classes  must  be  eliminated  as  they  are  isolated 
occurrences  not  indicative  of  the  actual  population 
distribution.  These  fluctuations  are  often  referred  to  as 
"noise”.  To  eliminate  "noise",  a  numerical  method  known  as 
least  square  polynomial  approximation  can  be  used. 
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Yn  1/6  {  *  2f  n-2  +  5f^)  • 

This  method  may  be  adjusted  to  non-equidistant  points,  Xj 
,...,xn.  It  is  used  where  linearity  over  three  points  is  a 
good  assumption.  Although  a  function  may  not  be  linear, 
this  method  applied  over  subranges  of  the  data  would  smooth 
the  function  satisfactorily.  It  may  also  be  applied 
repeatedly,  obtaining  greater  smoothing  and  eventually  a 
straight  line. 

(2)  Five  point  linear  approximation.  For  formulas, 
see  Hildebrand  (1956).  This  method  in  application  is 
similar  to  the  three  point  linear  approximation.  However, 
its  effect  on  smoothing  is  more  pronounced.  It  should  be 
applied  only  when  linearity  over  five  points  is  a  good 
assumption . 

(3)  Approximation  by  a  polynomial  of  degree  greater 
than  one.  This  method  is  used  when  the  true  function  is 
thought  to  be  a  ncn-lineai  polynomial.  The  polynomial  may 
be  chosen  so  that  it  fits  the  observed  data  exactly,  but 
this  would  reflect  ”noise”.  Thus  a  polynomial  of  degree 
less  than  the  number  of  function  values  should  be  chosen, 
based  on  intuitive  knowledge  of  the  true  function. 

B .  Parameter  Estimation 

Empirical  distributions  can  generally  be  fit  to 
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theoretical  distributions  using  the  method  of  rocments  or  the 
method  of  maximum  likelihood  to  determine  the  parameters  of 
the  theoretical  distribution. 

The  method  of  moments  is  the  eldest  and  simplest 
method.  For  a  k  parameter  distribution,  it  consists  of 
determining  the  first  k  moments  of  the  theoretical 
distribution  which  are  generally  functions  of  the 
parameters,  and  the  first  k  moments  of  the  empirical 
distribution.  Then  by  equating  each  of  the  k  moments,  k 
simultaneous  equations  in  k  unknowns  are  obtained  which  can 
be  solved  for  the  k  parameters. 

The  method  of  maximum  likelihood  depends  on  obtaining 
a  value  of  the  unknown  parameter,  0,  which  maximizes  a 
likelihood  function,  I,  defined  as: 

I  =  P  (X2=X2 ) P  (X2  =  x2 ) . . .P (Xn=xn)  where 

X1,X2,...,Xn  is  a  random  sample  from  a  discrete  probability 
distribution  and  the  X-L*s  are  the  actual  values  of  the 
random  variables,  X*u.  For  a  continuous  distribution,  P(X^=x*) 
is  replaced  by  the  density  function  of  the  random  variable  X* 
at  the  sample  point  x  ^ .  This  method  can  be  said  to  obtain 
the  value  of  0  which  maximizes  the  probability  of  obtaining 
the  particular  sample.  For  further  discussion  of  this 
method  and  its  advantages,  see  Hillier  and  Lieberman  (1967). 


Parameter  estimation  methods  pertaining  tc  a  specific 
distribution  are  often  used.  One  to  estimate  the  parameters 
of  a  Weibull  distribution  has  been  developed  ty  Kao  (1960, 
cited  in  Andersen  1972),  based  on  the  following  property  of 
the  Weibull  distribution: 

W=ln  In  (1/ (1-F  (t)  )  )  =  A  In  t  -  A  In  E 

where  A  and  E  are  the  parameters  of  the  distribution.  By 

i 

letting  z  =  In  t,  x  =  A  In  E  and  y  =  A,  and  ty  performing  a 
regression  of  W  =  yz  -  x  on  z,  estimates  of  x  and  y  are 
obtained.  Then  solving  A  =  y  and  P  -  exp  (-x/y)  for  A  and 
R  yields  the  estimates  of  the  parameters  A  and  E. 

D.  Gcodness-of-Fit  Tests 


To  determine  how  well  an  empirical  distribution  is 
approximated  by  a  theoretical  distribution,  a  goodness-of- 
fit  test  can  be  performed.  One  such  test  commonly  used  is 
the  chi-sguare  test.  It  is  based  on  the  calculation  of  the 
differences  of  the  empirical  distribution  values  from  the 
theoretical  distribution  values.  For  large  samples,  the 
following 

^  (  (f 0  -  ffi)  /fe)  where  f0  is  the  empirical 

distribution  value  and  fe  is  the  theoretical  value,  for 
large  samples,  has  a  chi-sguared  distribution  with  k-1 
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degrees  of  freedom  where  k  is  the  number  of  terms  in  the 
sum.  If  it  is  necessary  to  calculate  the  parameters  cf  the 
theoretical  distribution  from  the  empirical  distribution, 
then  the  best  possible  estimates  of  the  parameters,  used  in 
the  same  manner  as  actual  parameter  values  would  be,  will 
result  in  a  chi-sguare  distribution  with  k-l-p  degrees  of 
freedom  where  p  is  the  number  of  estimated  parameters. 

i 

Measurements  of  Differences  in  Empirical  Distributions 

To  detect  differences  in  empirical  distributions,  the 
means,  standard  deviations,  and  coefficients  cf  variation 
{standard  deviat ic n/mean)  can  be  compared.  If  the 
distributions  do  not  have  very  long  tails,  skewness  and 
kurtosis  may  also  be  considered.  For  more  intuitive  and 
complete  information,  the  actual  shapes  of  the  distributions 
as  illustrated  by  the  cumulative  prcpcrticns  and  graphical 
representations,  can  be  examined.  In  this  regard,  the 
height  of  the  peak,  the  breadth  of  the  peak,  the  piopcrticn 
cf  jobs  lying  in  the  peak,  the  maximum  point  of  the  peak, 
and  the  fullness  and  length  of  the  tail  are  important. 

Data  Gathering  and  Handling 


Source  of  the  Data 
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The  data  for  this  research  was  obtained  from  the 
University  of  Toronto  accounting  records  over  a  thirty  day 
period  in  November  and  December  of  1372.  The  accounting 
records  contain  information  frcm  the  HASP/CS-MVT  system 
operating  on  an  TP?!  370/165.  HASP  manages  jots  from  three 
sources.  A  brief  description  cf  each  fellows. 

(1)  General  purpose  job  stream  (GPJS) .  This  is  a 
batch  processing  facility  that  handles  jobs  requiring 
utilities  and  language  processors  such  as  PL/I,  C070L, 
FORTRAN,  PPG,  and  Assembler.  A  job  is  submitted  with  a  job 
card  followed  by  job  control  language  (JCL)  cards,  providing 
information  required  to  process  the  job.  Much  cf  this 
information  and  additional  information  gathered  during  the 
processing  of  the  job,  is  stored  in  the  accounting  records. 
These  jobs  require  an  amount  cf  computer  time  limited  by 
user  class  specifications. 

(2)  High  speed  job  stream  (HSJS) .  This  is  a  batch 
processing  facility  that  handles  short  jobs  requiring 
special  high-speed  langauge  processors  such  as  tiATFIV, 

PLUTO,  ALGOLW ,  and  PL/C.  It  provides  very  fast  turnaround 
service  free  of  charge.  A  job  card  providing  very  limited 
information  is  submitted  with  a  job.  This  information  plus 
limited  information  gathered  during  the  processing  of  the 
job  is  stored  in  the  accounting  records. 
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(3)  Conversational 
a  system  which  interfaces 
remote  stations  linked  to 
communication  lines  to  be 


remote  job  entry  (CFJE) .  This 
with  RASP  to  allow  jobs  from 
the  IBM  370  via  low-speed 
submitted  for  batch  processing 


is 


by 


HASP. 


Description  of  the  Data  Gathered 

Accounting  information  is  stored  on  a  daily  basis, 
i.e. ,  one  24-hour  period  is  replaced  by  the  next  24-hcur 
period.  Thus,  when  a  24-hour  period  had  passed,  a  PL/I 
program  was  submitted,  which  examined  the  information  from 
this  time  period,  extracted  a  number  of  the  accounting  items 
of  interest  in  this  research,  and  stored  these  on  a  tape. 
This  was  repeated  until  over  30,000  jobs  from  the  general 
purpose  job  stream  had  accumulated. 

For  each  job  in  the  general  purpose  job  stream,  the 
following  information  was  recorded: 

(1)  priority.  A  user  submitting  a  jot  to  the  system 
requests  a  priority  of  4,  6,  or  8  for  that  job.  A  priority 
of  6  is  charged  at  a  normal  rate;  a  priority  of  8  is  twice 
the  charge  for  a  priority  of  6;  and  a  priority  of  4  is  .6 
times  the  charge  for  a  priority  of  6.  Priority  8  provides 
the  fastest  turnaround  time. 
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(2)  source.  Various  remote  terminals  exist  at  the 
University  and  in  surrounding  areas  from  which  HASP  jobs  are 
submitted.  These  jobs  are  assigned  a  two  letter  code 
corresponding  to  the  various  terminals. 

(3)  authorization  code.  Every  jcb  has  an  account 

authorization  letter  from  cne  of  the  following  areas:  (a) 

administration,  (b)  commercial,  (c)  undergraduate  course 
work,  (d)  graduate  course  work,  (e)  research,  (f)  other 

i 

universities,  (g)  other  than  university  or  commercial  but 
having  seme  affiliation  with  universities,  and  (h)  Computing 
Centre  staff  work. 

(4)  authorization  code  number.  Every  job  is  assigned 
an  account  number  along  with  the  account  letter.  These 
numbers  range  freir  1  to  2C00. 

(5)  number  of  cards  read  in. 

(6)  estimated  execution  time  for  the  job.  This 
includes  CPU  time  plus  the  time  spent  waiting  for  I/O  to  be 
accomplished . 

(7)  estimated  number  of  lines  to  be  printed. 

(8)  estimated  number  of  cards  to  be  punched. 

(9)  amount  charged  for  the  job  as  a  functicn  of  core 
allocation,  CPU  time  and  I/O  time.  If  CPU  time  and  I/O  time 
are  not  both  zero,  core  allocation  is  obtained  from  the 
amount  charged  for  the  job. 
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(10)  the  time  at  which  a  job  was  passed  to  the 
operating  system. 

(11)  the  class  of  a  job.  A  job  class  can  be  A,  E,  C, 
or  M.  A  class  A  job  has  constraints  of: 

(1)  <  200K  cf  core  (this  is  not  enforced) , 

(2)  <  5  minutes  total  run  time, 

(3)  no  tape  and/or  disk  mounts, 

(4)  <  15,000  printed  lines,  and 

t 

(5)  <  2000  cards  punched. 

A  class  E  job  has  constraints  of: 

(1)  <  200K  cf  core, 

(2)  <  5  minutes  total  run  time, 

(3)  1  7-track  tape  and/or  9-track  tape  and/or  one 
disk  mount, 

(4)  <  15,000  printed  lines,  and 

(5)  <  2000  cards  punched. 

A  class  C  job  has  no  constraints.  The  job  class  M  is  used 
primarily  by  Computing  Centre  staff  when  immediate 
turnaround  is  imperative,  usually  in  cases  dealing  with  the 
repair  or  maintenance  of  the  operating  system. 

Eli2<I£am_nescr  ijrtic  n 

A  PI/I  program  was  developed  to  process  this  data. 

The  objective  of  the  program  is  to  obtain  distributions  of 
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the  processing-  ti ses  (i.e.,  CFD  times,  and  hereafter 
referred  to  as  CPU!)  of  jobs,  and  since  we  are  dealing  with 
a  mu  1 1 iprogram m ing  environment,  distributions  of  the  total 
run  times  {i.e.,  CPU  time  plus  I/O  time,  and  hereafter 
referred  to  as  TRT)  of  the  jobs.  These  distributions  are 
conditioned  on  zero,  one,  or  two  of  the  accounting 
informa tion  items  outlined  previously. 

The  program  consists  of  the  following  sections  or 
procedures : 

(1)  Determination  of  the  CPUT  interval  or  TRT 
interval.  Only  general  purpose  job  stream  jots  are 
considered.  High  speed  job  stream  jobs  are  eliminated 
because  CPU  times  and  I/O  times  are  net  recorded. 

(2)  Determination  of  the  conditioning  parameter 

class . 

(3)  Tabulation  of  the  appropriate  frequency  matrix 
class  (determined  by  (1)  and  (2)  above) .  These  classes  are 
in  the  form  of  buckets.  P  bucket  depends  upon  the  number  of 
conditioning  parameters  and  the  nature  of  the  parameter, 
i.e.,  discrete  or  continuous.  By  discrete  we  are  referring 
to  parameters  whose  values  cannot  be  grouped  through 
interval  classification,  e.g.,  priority  and  class.  By 
continuous,  we  are  referring  to  parameters  whose  values  can 
be  grouped  through  interval  classification.  £  bucket  can 
represent : 
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(a)  a  C PUT  or  TFT  interval, 

(b)  a  CPUT  or  TBT  interval  and  one  {or  two) 
discrete  parameter  value  (s)  , 

(c)  a  CPUT  OR  TRT  interval  and  one  (or  twc) 
parameter  interval  {s)  ,  or 

(d)  a  CPUT  or  TPT  interval,  one  discrete 
parameter  value,  and  one  parameter  interval* 

(4)  Smoothing  of  the  frequency  distributions.  A 

i 

three-point  linear  approximation  technique  applied  twice  in  * 
succession,  is  used.  This  is  considered  a  good  method 
because  in  areas  of  distribution  peaks,  the  function  values 
are  spread  over  small  intervals,  and  at  larger  intervals, 
i.e.,  the  tails  of  the  dist r ibut ions,  the  shape  of  the 
distributions  is  generally  linear. 

(5)  Distribution  statistics.  These  include  the  mean, 
standard  deviation,  skewness,  and  coefficient  of  variation, 
calculated  using  grouped  data  methods. 

(6)  Proportion  distribution.  The  proportion  in  each 
frequency  interval  and  the  cumulative  proportion  up  to  and 
including  that  interval,  are  calculated. 

(7)  Distribution  characterization.  The  mean  residual 
life  and  rank  of  each  distribution  are  calculated. 

(8)  Chi-square  test. 

(9)  Theoretical  distribution  calculations.  These 
deal  with  the  exponential,  hyperexponential,  and  gamma 
distributions.  The  parameter  estimates  are  obtained  as 
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fellows: 

(a)  exponential:  Va  =  mean  of  empirical 
distribution.  This  is  a  maximum  likelihood  estimator. 

(b)  h y perexpenent ial :  the  method  of  moments  is 
used  to  calculate  and  A2.  The  estimates  are: 

=  (4pm  ±  16p2m2  -  8p  (2m2  -  (1-p)  (sd2  ♦  m2))/ 

(4m2  -  2(l-p)  (sd2  4-  m2)  ) 
and  =  (  Ajll-p)  )/ (A1n-p) 


where  m  and  sd  are  the  mean  and  standard  deviation, 
respect! vely ,  of  the  empirical  distribution.  p  assumes 
values  from  0  to  1. 


obtain 


(c)  gamma:  the  method  of  moments  is  used  to 


2  2  2 

A  =  m  /  sd  and  E=sd/ra. 


Watching  of  the  mean  and  maximum  rather  than  the  mean  and 
variance  produces 

A  =  m/  (m-max)  and  E  =  m-raax 

where  max  is  the  point  at  which  the  empirical  distribution 
attains  its  maximum.  These  are  also  considered  in  the 
program. 


Seme  I nplemen tat ic n  Considerations  of  the  Program 
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The  data  frcra  the  accounting  tape  is  in  bit,  character 
or  fixed  binary  form.  To  process  this  information  in  an 
efficient  manner,  overlay  storage  techniques  are  used  which 
enable  comparison  on  a  bit  basis  rather  than  a  byte  or  word 
basis . 

The  frequency  distribution  is  in  the  form  of  an  m  by  n 
matrix  where  m  is  the  CPUT  or  TRT  interval  and  n  indicates  a 
conditioning  parameter  class.  If  there  is  only  one 
conditioning  parameter,  then  the  columns  range  frcra  1  to  the> 
number  of  classes  for  that  parameter.  If  there  are  t%o 
parameters,  then  the  columns  range  frcra  1  to  pg  where  p  is 
the  number  of  classes  for  the  first  parameter  and  q  is  the 
number  of  classes  for  the  second  parameter.  Cther 
calculations  are  in  table  form. 

Program  Output 


A  typical  run  would  produce  the  following 


FREQUENCY  FOR  CLASS  3  IS  2508 

MAXIMUM  IS  15.50 

M FAN  FOR  CLASS  3  IS  409.72 

STANDARD  DEVIATION  FOP  CLASS  3  IS  3543.90 

SKEWNESS  FOR  CLASS  3  IS  19.29 

COEFFICIENT  OF  VARIATION  IS  8.6495 
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VALI1F 

W  T  T  [I  3  3  D  .  C  . 

F  .  IS  6 5.86 

The  above 

table 

is  the  CPU? 

distribution  for  a 

Core 

Allocation 

clasj 

i  of  40-6 OK. 

The  SMED  FREQ  column  presents 

the  valuer  resulting  from  smoothing  the  distribution 
represented  in  the  FREQ  column.  Results  are  in  hundredths 
of  seconds.  Because  such  tables  are  difficult  to  read,  we 
represent  the  results  in  the  form  of  graphs  which  depict  the 

> 

variations  in  the  distributions. 
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CHAPTER  III 


Experiments  and  Results 

Various  experiments 
resulting  distributions, 
general  topics: 

(1)  representative 

(2)  best  partition 

(3)  distribution  f 

(4)  distribution  c 

Results  are  presented  wit 
general  observations  base 
presented  under  that  topi 

Representative  Distributi 

Ey  representative  d 
that  meaningfully  reflect 
or  TRT  on  zero,  one,  or  t 
representative  distributi 
in  two  areas: 

(1)  interval  adjus 

(2)  interval  adjus 
parameter  classes. 


were  conducted  on  the  data  and  the 
They  are  discussed  under  four 

distributions, 

s  t 

itting,  and 
haracterizations. 

h  the  experiments.  A  section  of 
d  on  the  results  in  (1)  is 

c. 

cns 

istributions  we  mean  distributions 
the  effects  of  conditioning  CPUT 
wo  parameters.  To  obtain  these 
cns,  we  first  conducted  experiments 

tment  involving  CPUT  and  TRT, 
tment  involving  continuous 


an  d 
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These  are  discussed  below. 

(1)  Interval  Adjustment  Involving  CPUT  and  TRT 

Our  objective  in  adjusting  these  intervals  was  tc 
obtain  maximum  information  of  the  distribution  of  the  jobs, 
e.g.,  the  peak(s)  of  the  distribution,  and  yet  to  maintain  a 
manageable  number  of  classes.  Our  method  involved  first 
obtaining  the  upper  limits  of  CPU T  and  TFT  and  then 
selecting  initial  intervals.  An  interval  for  times  of  zero 
seconds  was  included  tc  isolate  jobs  that  are  terminated 
before  requiring  any  processor  time  (due,  most  likely,  to  a 
JCL  error) .  The  number  of  intervals  for  the  CPUT  and  TFT 
were,  initially,  17  and  18,  respectively.  This  was 
increased  gradually  to  obtain  a  more  equal  distribution  over 
all  the  intervals,  through  increasing  the  number  of 
intervals  in  areas  of  great  concentration  of  jobs. 

A  final  set  of  43  intervals  for  both  CPUT  and  TFT 
emerged.  This  handles  reasonably  well,  all  significant 
distribution  classes  (freguency  greater  than  50)  resulting 
from  unconditioned  or  conditioned  CPUT  and  TFT,  fairly  well. 
Those  classes  with  an  extremely  high  number  of  observations 
are  well  distributed  over  the  intervals.  Those  classes  with 
an  extremely  lew  number  of  observations  (50  tc  100)  are 
distributed  relatively  evenly,  although  seme  intervals  may 
net  ccntain  any  jobs.  Classes  of  frequency  less  than  50  are 
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not  considered  to  be  conclusive  d istr ibut ions  of  that  area 
of  a  total  population.  Any  significantly  greater  number  of 
intervals  is  felt  to  hamper  visualization  of  the  general 
shape  of  the  distributions. 

The  decision  to  have  only  one  set  of  intervals  for 
both  CPUT  and  TBT  was  made  primarily  to  aid  comparison  of 
the  distributions  for  each  of  these  variables.  This  has 
little  effect  on  the  actual  distributions  or  relevant 
statistics.  Those  intervals  of  lew  CPOT  with  high 
incidences  of  jobs,  contain  fewer  jobs  in  TBT  distributions. 
Those  intervals  with  high  incidences  of  jots  in  TRT 
distributions,  contain  fewer  jobs  in  CPUT  distributions. 

{2)  Interval  Adjustment  Involving  Continuous  Parameter 
Classes 

Our  objectives  in  adjusting  these  intervals  were: 

(a)  to  locate  any  differences  in  the  CPUT  and 
TBT  distributions  conditioned  on  these  intervals, 

(b)  to  obtain  relatively  even  total  freguencies 
in  each  class,  and 

(c)  again  to  maintain  a  manageable  number  of 

classes. 

To  illustrate  the  problem,  consider  CPUT  distributions 
conditioned  on  the  amount  of  core  allocated.  A  core 
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allocation  class  cf  100-120K  produces  a  hyperexjicnontial- 
like  distribution.  However,  if  the  class  interval  is 
increased  to  100-150K,  the  distribution  becomes  somewhat 
gamma-shaped  where  the  mode  is  removed  from  the  first 
interval.  And  if  the  class  interval  is  decreased  to  100- 
1 1  OK ,  then  a  small  total  class  frequency  indicates  an 
unknown  distribution. 

Our  method  was  similar  to  that  in  section  (1). 
Estimates  of  the  average  value  and  the  upper  bound  of  the 
parameters  were  initially  available.  With  these  two 
guidelines  in  mind,  the  initial  set  of  intervals  was  chosen. 
After  a  number  of  runs  using  these  intervals,  they  were 
modified  in  line  with  our  objectives.  Table  I  below  gives 
the  upper  bound  of  each  interval  in  the  set  of  intervals  for 
each  of  the  continuous  parameters  considered. 
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TAEIE_I 


#  of 

Estimated 

Estimated 

Cards 

Execution 

#  of  lines 

Read  In 

Time 
(sec. ) 

Output 

50 

10 

500 

100 

20 

1C0C 

150 

30 

2  COO 

200 

40 

3000 

250 

50 

4000 

300 

60 

5000 

800 

80 

6  CO 0 

1000 

100 

8000 

4000 

150 

10C00 

60000 

240 

420 

600 

900 

3000 

43200 

86400 

15000 
60  COO 
500  C  00 
1CC0C00 

10  16  13 


Est i mated 

#  of 

Amount 

Time  of 

Cards 

of  Core 

E  xecu t ion 

Punched 

Allocated 

<K) 

5 

10 

2  A  .  M  . 

50 

20 

4 

ICO 

40 

6 

150 

60 

8 

200 

,  ec 

9 

300 

ICC 

10 

400 

150 

11 

600 

2CC 

12 

800 

300 

1  P.H. 

1000 

5CC 

2 

1200 

1100 

3 

1500 

4 

2000 

5 

3000 

6 

6CCC0 

7 

8 

9 

10 

11 

12 

15 

11 

20 

Number  of  Classes 

Results 


Eefore  discussing  the  distributions,  cne  relevant 
observation  is  of  importance.  The  bucket  method  used  to 
obtain  our  distributions  is  not  felt  to  affect  cui  results 
significantly.  As  we  have  employed  it,  in  areas  of  a  high 
concentration  of  jobs,  a  larger  number  of  buckets  have  been 
used,  as  opposed  to  the  number  used  in  low  concentration 
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areas.  For  example,  from  0  tc  1  seconds,  we  use  10  buckets; 
from  10  tc  100  seconds,  we  use  11  buckets.  In  this  manner, 
we  can  obtain  a  fairly  accurate  idea  of  the  shape  of  the 
distribution. 

The  distributions  obtained  frcrn  conditioning  CFOT  and 
TFT  on  the  continuous  parameters  and  also  on  the  discrete 
parameters  were  examined  to  determine  which  parameters  would 
provide  good  information  about  the  processing  time 
distribution  of  jobs  before  their  actual  processing.  The 
criteria  of  a  good  parameter  are: 

(1)  Distinct  processing  time  distributions. 

(2)  If  discrete,  the  number  of  classes  is  less  than 

25. 

(3)  No  single  default  is  taken  by  most  of  the  jobs. 

The  following  parameters  were  eliminated: 

(1)  source  (greater  than  25  classes  with  low 
frequency  in  seme). 

(2)  authorization  number  (greater  than  1000  classes 
with  lew  or  zero  frequency  in  some  classes) . 

(3)  time  at  which  a  job  was  passed  to  the  operating 
system  (distributions  are  similar  for  all  intervals  except 
10-11  P.M.  and  11-12  P.M.  which  are  times  used  by  the 
Computing  Centre  staff). 

(4)  estimated  number  of  cards  punched  (less  than  20% 
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of  the  jobs  had  estimates  other  than  default  40C  cards). 
The  parameters  remaining  for  further  investigation 

were: 

(1)  priority. 


(2) 

authorization  code 

letter. 

(3) 

number  of 

cards  read  in. 

(4) 

estimated 

number  of 

lines 

output. 

(5) 

estimated 

execution 

time , 

(6) 

amount  of 

core  allocated. 

and 

(7)  class. 

Best  Partitions 

Conditioning  on  One  Parameter 

The  seven  parameters  considered  as  good  indicators  of 
the  processing  time  distributions  of  jobs  were  nejt  examined 
frcm  the  viewpoint  of  partitioning  the  representative 
distributions  into  distinctly  different  distr ibutions .  This 
involved  measuring  the  differences  in  the  distributions  tc 
determine  which,  if  any,  should  be  combined. 

Of  the  discrete  parameters,  i.e..  Priority, 
Authorization  Code  Letter  (hereafter  referred  to  as 
Authorization  Code),  and  Class,  Priority  was  eliminated  frcm 
further  investigation  and  Authorization  Cede  was  adjusted. 
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Priority  was  dropped  because  of  its  3  classes,  only  two 
distributions  can  be  considered  as  distinct  from  each  ether. 
Priority  4  with  11,000  lots,  emerges  as  one  distribution, 
while  Priorities  0  and  8  with  the  remaining  22,000  lobs 
emerge  as  the  ether.  Figure  3  shows  tie  three  CPDT  and  TFT 
distributions.  (Note  the  remarks  pertaining  to  this  figure 
under  results  helcw) .  Other  parameters,  because  c^  their 
greater  number  cf  partitions,  were  considered  mere 
informative  as  to  the  processing  time  distributions  cf  jobs  * 
entering  the  system. 

Authorization  Code  produces  relatively  distinct 
distributions  of  sufficient  frequency  in  6  cf  its  8  classes, 
i.e.,  in  ccirmerci  a  1 ,  administrative,  graduate, 
undergraduate,  research,  and  Computing  Centre  staff.  Put 
the  classes,  ether-university  and  university-assccia ted, 
have  less  than  . 3 f  of  the  gobs  in  each.  Since  cther- 
universit.y  is  similar  to  graduate,  and  u  n  i  vers  i  t  y-assccia  ted 
is  similar  to  Computing  Centre  staff,  these  classes  were 
cc  rob ined . 

Of  the  continuous  parameters,  i.e..  Number  cf  Cards 
dead  Tj],  Estimated  Number  of  Lines  Output,  Estimated 
Execution  Time,  and  Core  Allocation,  Estimated  Execution 
Time  and  Core  Allocation  were  adjusted.  The  number  cf 
Estimated  Fxccuticn  Timc  classes  was  reduced  frem  15  classes 
tc  3.  The  number  cf  classes  for  Core  Allocation  was  reduced 


sec.  Log  Scale  of  Time  from  .1  seconds  to  70  seconds  70  sec 

FIGURE  3.  CP UT  AND  TRT  CONDITIONED  ON  PRIORITY 
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S  1 


from  11  to  8. 

The  distributions  of  Estimated  Number  cf  Lines  Output 
are  all  distinct  to  seme  degree,  but  intervals  5CC-1000, 
2000-3000,  and  400C-5000  resemble  each  ether  semowhat,  and 
likewise,  intervals  1000-2000,  30CC-4C00,  6CCC-8CC0,  and 
8CC0-10000.  This  parameter  was  net  ccnsidered  in  further 
ccnditioninq  experiments  for  the  fcllcwing  reasons: 

(1)  Its  large  number  of  classes  (13)  means  that  with 
conditioning  on  twe  parameters,  up  to  130  classes  could 
result,  with  very  few  jobs  in  many. 

(2)  A  relationship  between  CPUT  and  the  Estimated 
Number  of  lines  Output  is  not  evident.  Instead,  as 
mentioned  above,  tie  distributions  are  quite  erratic. 

Other  parameters  are  better  in  these  respects,  as  is  evident 
in  later  discussions. 

The  distributions  of  the  Number  of  Cards  Bead  In  prove 
to  be  all  distinct  to  seme  degree.  However,  here  again  the 
definite  relationship  existing  between  CPUT  and  both  Core 
Allocation  and  Estimated  Execution  Time  is  net  as  evident. 
The  greater  number  of  classes  (10)  as  compared  to  the  8  of 
both  Core  Allocation  and  Estimated  Execution  Time  is  a 
deterrent  frem  considerations  in  two  parameter  conditioning. 
The  ten  distributions  tend  to  be  much  more  similar  to  each 
other  than  those  cf  Estimated  Execution  Time  cr  Core 
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Allocation.  Since  it  does  net  perform  as  well  as  the  ether 
continuous  parameters,  it  was  not  considered  for  further 
investigation. 

Results 

The  final  set  of  partitions  for  each  of  the  parameters 
is  given  below.  Any  job  in  a  particular  interval  will  have 
a  value  greater  than  the  lower  bound  and  less  than  or  equal 
to  the  upper  bound. 


TAFLE  II 


#  of  Cards 

Estimated  # 

Fst i mated 

Amount  o r 

Read  In 

of 

E  xe  cut icn 

Core 

lines  Output 

Time 
(sec,  ) 

Al  l  oc  a  ted 
(K) 

0-50 

0-500 

0-10 

0-10 

50-100 

500-1000 

10-20 

10-40 

100-150 

1000-2000 

20-60 

40-6  0 

150-200 

2000-3000 

6C-150 

60-80 

200-  250 

3000-4000 

150-240 

60-1 30 

2  50-300 

4000-5000 

240-420 

1  3  0- 2  0  0 

300-800 

5000-6000 

4 20-90 C 

200-300 

800-1000 

1000-4000 

4000-60000 

600C-80C0 

800  0-10  COO 
10000-15000 

1 500  0-60  CO  0 
60000-50  COCO 
5C0000-1CCC000 

900-43200 

300-1100 

To" 

"11" 

1” 

-- 

Number  of  Classes 
Priority:  4,  Combined  6  and  8 

Class:  A,  P,  C ,  K 

Authorization  Code:  Administration 

Commercial 

Undergraduate 

Research 

Graduate 

Computing  Centre  Staff 

Figures  6  to  14  depict  the  density  function  distributions  of 
parameters  considered  in  further  experiments,  based  on  these 
partitions.  In  examining  these  figures,  the  following 
should  be  kept  in  irind: 

(1)  The  paper  used  is  probability  (ordinate)  by  2  log 
(abscissa)  cycles.  The  leg  abscissa  is  necessary  to  portray 
distributions  relevant  over  a  long  interval  of  time,  viz.,  0 
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tc  2500  seconds,  and  concentrated  in  a  small  portion  cf  that 
interval,  viz.,  0  to  2  seconds,  primarily.  The  probability 
ordinate  is  essential  in  distinguishing  variations  in  the 
lew  proportions  of  jobs  in  the  distribution  tails. 

(2)  Using  this  paper  implies  that  the  zero  value  of 
the  d ist ributions  is  not  represented.  The  ordinate  begins 
with  a  .0001  probability,  and  the  abscissa  with  .1  seconds. 

Figures  4  and  5  give  examples  of  distributions  drawn 
to  a  normal  scale.  The  long  tails  are  not  represented 
because  of  graph  size  considerations. 

Conditioning  on  Two  Parameters 

Conditioning  on  combinations  of  two  of  the  following 
four  parameters  was  performed  to  determine  if  the  resulting 
distributions  were  again  distinct: 


(1) 

Class 

(2) 

Authorization 

Code 

(3) 

Amount  cf 

Core 

Allocated 

(4) 

Estimated 

Exec 

ution  Time 

Results 

The  resulting  distributions  are  shown  in  Figures  15  to 
26.  These  "graphs”  are  intended  to  portray  the  general 
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shape  of  the  distributions,  not  the  details.  In  this 
regard,  the  following  should  be  noted: 

(1)  The  ordinates  vary  from  0  to  15,  to  C  to  100,  and 
are  marked  accordingly.  This  method  is  necessary  to  convey 
the  shapes  of  flat  distributions,  as  well  as  the  shapes  of 
very  peaked  distributions. 

(2)  The  first  two-fifths  of  the  abscissa  is  divided 
equally  among  the  intervals,  C-.l  seconds,  .1-.2 
seconds ,...,. 9-1 . C  seconds.  The  remaining  three-fifths  of 
the  abscissa  is  divided  equally  among  the  intervals,  1-4 
seconds,  4-7  seconds,  and  7-10  seconds.  This  is  necessary 
to  convey  the  nature  of  the  peaks. 

(3)  The  mean  (in  seconds) ,  standard  deviation  (in 
seconds),  coefficient  of  variation,  and  frequency  are  given 
above  the  graph,  for  that  particular  graph.  On  the  graph 
itself,  appear  the  percentage  of  jobs  lying  in  the  interval 
ccntaining  the  maximum  point  of  the  distr ibuticn,  and  the 
maximum  pcint  value  in  relation  tc  CPUT  or  TFT  (in  seconds). 

(4)  Occasionally,  a  box  will  appear  blank,  with  only 
the  frequency  provided  abcve  it.  This  class  contains  too 
few  and  toe  scattered  a  number  of  jebs,  tc  provide  a 
meaningful  distribution.  If  a  distribution  dees  appear  with 
only  the  frequency  above  it,  then  the  class  contains  too  few 
jobs  to  provide  meaningful  statistics,  tut  those  few  are 
concentrated  in  one  area. 
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Seme  interesting  cbfprvaticns  can  he  made  ccncccnia'i 
the  empirical  distributions.  Generally,  their  shapes  are 
gamma-like  with  extremely  long  tails.  An  occasional 
distribution  resembles  a  hy perexpenent ial  cr  exponential 
distribution.  The  TFT  distributions  are  similar  to  their 
ccr responding  CFTJT  distributions  in  that  if  ere  peaks,  the 
ether  does  also,  but  the  peaks  of  the  tft  distributions  are 
much  flatter  and  are  displaced  linearly  tc  the  right. 

Particular  ebservatiens  cf  the  distributions  are 
provided  below,  alcnq  with  insights  where  relevant,  provide 
by  knowledge  cf  the  JW/310  operating  system. 

Zero_Par  amete  r_Co  ndi  tic  nine]  (figure  6 ) 

(1)  The  TFT  distribution  shews  a  greater  frequency 
in  the  first  0  to  .1  second  interval  than  in  the  following 
one  or  two  intervals.  This  may  be  attributable  tc  jobs 
completing  a  compilation  phase  only,  i.e.,  jobs  that  do  net 
execute,  probably  due  to  syntax  errors. 

(2)  As  is  expected,  the  CP  FT  distribution  has  a 
greater  proportion  of  jobs  in  the  beginning  intervals  than 
TFT  does  (0  to  1  seconds  accounts  for  353  of  the  jobs  in  th 
CFTJT  distribution,  and  only  life  cf  the  jobs  in  the  TFT 
distribution).  It  also  has  fewer  jobs  in  the  tail  interval 


. 


■ 


(]0  t. o  2500  seconds  accounts  for  16$  cf  the  jots  in  the  CFUT 
distribution  and  of  the  jobs  in  the  T  distribution). 

(3)  The  coefficients  of  variation  of  the  CFUT  and  79? 
distributions  are  5.9  and  3.7,  respectively. 

One  Parameter  Conditioning 

(1)  C^ass  (Figures  7  and  8) 

(a)  CFUT  and  TPT  distributions  are  distinct  t- a 
seme  degree,  but  generally  do  not  display  much  variation, 
although  TPT  d  ist r  ibut ions  are  better  in  this  re  gar  1  thin 
CFUT  distributions.  Class  A  jets  of  the  TRT  distributions 
displayed  the  characteristic  described  in  Zer c  Parameter 
(1)  -  The  explanation  there  is  supported  by  tie  reasoning 
that  Class  B  and  C  jobs  probably  debug  syntax  errors  in 
Class  A  since  it  provides  faster  turnaround  time  and  is 
cheaper. 

( b )  Class  M  has  the  most  distinctive 
distribution.  Since  this  is  the  Computing  Centre  staff 
class,  many  shert  programs  related  directly  tc  the  function 
of  the  Centre  are  run,  and  many  of  these  jehs  run  repeatedly 
throughout  a  day. 

(c)  All  classes  except  M  displayed  long  tails. 
TFT  tails  were  more  full  than  CPU!  tails. 


(d)  The  jobs  were  not  distributed  evenly  ever 
the  classes.  Class  A  had  mere  than  75$  of  the  jots.  Clas 
E ,  C  and  M  had  approximately  10$,  55  51  #  and  10$,  respective! 

(e)  The  coefficients  cf  variaticn  i»rrr 
significantly  less  than  the  coefficients  cf  variation  for 
the  unconditioned  distributions  in  75°?  of  all  the  classes. 

(2)  ijUthorj.2ati_cn_Code  (Figures  0  and  10) 

(a)  CEl’T  distributions  are  distinct  but  dc  not 
display  much  variation  among  classes,  with  the  exception  c 
the  Computing  Centre  Staff  class.  Again,  TFT  distributions 
are  more  dissimilar  from  each  other  than  the  CPU 
distribut ions,  reflecting  the  varying  I/O  demands  of  the 
different  groups  cf  users. 

(b)  The  number  of  jobs  in  each  of  the 
Undergraduate,  Research,  and  Computing  Centre  staff  classe 
is  approximately  the  same  and  in  all  account  ter  more  than 
75$  of  the  jobs.  The  Graduate,  Administration,  and 
Commercial  classes  have  close  tc  11$,  5.5$,  and  3.5$, 
respectively . 

(c)  The  coefficients  cf  variaticn  for  the  CPU’7' 
and  TRT  distributions  are  significantly  less  than  those  fo 
the  unconditioned  distributions  in  67$  and  P3$  cf  the 
classes,  respectively. 
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(3)  Estimated  Fxec ut ic n_Ti me  (Figures  11  an.]  12) 

( a )  P  o  t  h  C  P  (J  T  and  T  P  T  distributions  show 
distinct  distributions  in  all  eight  classes.  Foth  shew  i 
gcod  degree  of  variation  in  the  shapes  cf  the  distributions 

(b)  As  is  expected,  there  is  positive 
correlation  between  increasing  processing  times  and 
increasing  estimated  execution  times.  For  example,  the 
means  and  maxim urns  ot  the  CFUT  distributions  rise  from  1 .  >• 
seconds  and  .16  seconds,  respectively,  for  the  0- 1 n  second 
estimated  executier  time  interval,  to  l1":.?1:  sec  or,  is  anl  3.  1 
seconds,  respectively,  for  a  900-432000  second  interval. 
Those  of  the  TFT  d  ist r ihu t i cn s  rise  from  12.3  seconds  and 
.76  seconds,  respectively,  for  a  C-10  second  interval  to 

6  3.24  seconds  and  4 . 2  c;  seconds,  res  pec  t  i  ve  1  y  ,  for  a  900- 
43200  second  interval. 

(c)  The  classes  have  relatively  equal  numbers  o 
jobs  in  each. 

(d)  A  peculiarity  of  this  system  can  he  seen: 
when  the  estimated  execution  time  is  egual  tc  actual  CTO 
time  plus  I/O  time,  the  job  should  he  terminated.  In 
theory,  there  should  be  no  TFT  value  greater  than  the  upper 
limit  of  the  estimated  execution  time  interval.  However, 
the  system  does  not  always  check  for  this  set  cf  events. 

(e)  Duly  in  50^  of  the  classes  cf  both  CFUT  and 
TFT  is  the  coefficient  of  variation  less  than  that  of  the 
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relevant  unconditioned  dist r ihu t icn •  The  decreasing 
coefficients  of  variation  with  increasing  estimated 
executior  times,  imply  that,  for  longer  jots,  this  parameter 
is  a  better  indicator  of  service  times. 

(4)  (Figures  13  and  14) 

(a)  As  in  the  Ksti mated  Fxecuticn  Time  results-, 
CEO?  and  TRT  distributions  conditioned  on  Core  Allocation 

> 

are  distinct  and  show  a  good  degree  of  variation. 

(b)  The  means,  the  right. war d  movement  of  the 
peaks,  and  the  proportions  in  the  tails  generally 
demonstrate  positive  correlation  with  increasing  amounts  cf 
core  allocation.  Feans  and  maximums  for  CFOT  distributions 
rise  from  .06  and  .06  seconds,  respectively,  for  a  0-1  OK 
cere  interval  tc  .66  and  1.3  seconds,  respectively,  for  a 
300- HOOK  interval.  Those  for  TRT  distributions  rise  from 
.10  and  .05  seconds,  respectively,  for  a  C-10K  interval  to 

9  7.8  an  d  ? .  °  sec-onus,  respectively,  for  a  3  00- 110  OK 
interval.  flov^rrs  ;  t  )■■■■  relationship  for  the  core  allocation 
parameter  is  net  as  consistent  in  each  of  the  core 
allocation  intervals,  as  it  is  for  the  estimated  execution 
time  parameter  in  each  of  its  intervals.  Put  it  is  also 
more  consistent  in  THT  distributions  than  in  CFCT 
1 istr ibu t ion s .  This  may  be  due  to  the  effect  of  the 
operating  system  cn  core  allocation,  i.e.,  the  default  core 


allocation  assigned  to  a  jot  when  none  is  specified  by  the 
user.  Tho  effect  would  he  lessened  in  TUT  distributions 
because  of  the  dispersion  caused  by  additional  I/C  time. 

(c)  The  jobs  are  not  distributed  evenly  over  the 
core  allocation  classes.  Intervals  of  P0-130K  and  13C~?')CF' 
each  have  close  to  335  of  the  jobs.  0-10K  and  30C-11Q0K 
each  have  cnly  15  of  the  jobs.  The  remainder  { 3  ?  ^ )  is 
distributed  relatively  evenly  over  the  four  remaining 
intervals. 

(d)  The  0-10 K  interval  displays  a  very  high— 
peaked,  short- tai.  led ,  non-increasing  distribution.  The  dO- 
130K  interval  for  CPUT  also  displays  a  non-increasing 
distribution,  but  with  a  long  tail.  There  are  no  known 
reasons  for  these  anomalies. 

( e )  In  less  than  h 0 %  c f  the  c 1  a s s e s  is  the 
coefficient  of  variation  signif icantly  less  than  tnose  of 
the  unconditioned  distributions. 

to  l  _Co  n  d  j_  t  i  on  i  ng 

For  ease  of  discussion,  particular  distributions  will 
be  referred  to  as  CPTJT/P1/P2  or  TRT/P1/P2  where  PI  and  P2 
arc  either  conditioning  parameters  or  specific  classes  of 
the  conditioning  parameters. 
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(1)  Class  ami  Estimated  Execution  Time  (Figures  IS 

and  16) 

(a)  Eistribut ions  are  distinct  tc  seme  degree 
fer  most  CPTJT  classes.  TRT  distributions  ace  less 
d ist i net-- t he  following  intervals  could  be  combined: 
TrT/R/150-240sec.  and  TRT/B/ 150- 240sec • ;  T Rl/^/4 2 C - 00 Csec . 
and  TRT/A/1900-43200sec.  ;  TPT/C/2  4  0-4  2  Ose  c  .  and  TET/C/420- 
9C0sec.  The  sets  of  distributions  in  Classes  A  an d  ? 
generally  do  not  show  much  variation  from  each  ether  for 
either  of  CPHT  or  TRT.  Class  C  distributions  have  generally 
flatter  peaks  than  those  of  Classes  A  and  P,  fer  CPU?;  tr^ 
distributions  are  all  very  flat.  Class  M  distributions  are 
very  distinct  from  those  of  ether  classes. 

(h)  The  relationship  between  Estinated  Execution 
Time  and  both  of  CEUT  and  TFT,  evident  in  cne  parameter 
conditioning  is  also  evident  in  this  t w c  parameter 
cond i tioning. 

(c)  The  percentage  cf  jobs  in  each  set  of 

classes  fer  the  Class  parameter  is  distributed  fairly  evenlv 
over  the  eight  execution  time  classes,  with  the  exception  of 
Class  Here,  the  first  interval,  0-10  seconds,  accounts 

for  0  ?.%  of  the  jobs  in  Class  M  and  virtually  nc  jobs  have 
estimated  execution  times  cf  greater  than  150  seconds. 

(d)  Tn  greater  than  R0*  of  the  classes  fer  both 
C FrJT  and  TRT,  the  coefficient  cf  variaticn  is  significantly 


less  than  the  coefficient  of  variation  of  the  relevant 
unconditioned  distribution. 

(e)  6  cut  of  the  32  possible  classes  in  both 

CFUT  and  TRT  conditioning  contain  an  insufficient  number  or 
jobs. 

(2)  Cl^f  Allocation  (Figures  17  and  18) 

(a)  The  distributions  are  distinct  for  all  CPUT 
classes.  TRT  distribu t ions  have  scire  similarities-- 

T  FT/B/13  0-200  K  and  TFT/P/  200-  300K ;  TRT/C/ 200- 30CK  and 
TET/C/300-1100K  can  be  combined.  Unlike 
CF  DT/Class/Es  t  ima  t  ed  Execution  Time,  the  sets  of  CPIIT 
distributions  for  Classes  A,  E,  C  and  1  all  shew  significant 
variation  from  each  other.  TRT  distributions  show 
significant  variation  in  lower  core  classes  tut  classes 
involving  core  allocation  greater  than  130K  were  quite 
similar  for  Classes  A,  E,  and  C. 

(b)  The  relationship  te tween  core  allocation  and 
CFUT  and  TRT  is  evident  here  in  each  set  of  distributions 

for  Classes  A,  E,  C,  and  M. 

(c)  As  is  expected  from  relevant  observations  in 
one  parameter  co nditioning,  the  percentage  cf  jobs  is 
distributed  unevenly  over  all  of  the  classes. 
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(d)  Greater  than  65s?  cf  the  classes  fcr  both 
CP UT  and  TET  have  coefficients  of  variation  significantly 
less  that  that  of  the  relevant  unconditioned  distribution. 

(e)  6  cut  of  the  32  possible  classes  in  both 
CPUT  and  TFT  conditioning  contain  an  insufficient  number  cf 
jobs. 

(f)  The  non -increasing  distribution  of  the  core 
allocation  class  fl-lOK,  noted  previously,  recurs  in  all 
classes  of  CPM/Class/O-ICK ,  but  not  in  every  class  of 

T ET/C lass/ 0-10 K  because  of  the  effect  of  I/C  time.  The  non- 
increasing  distribution  of  a  core  allocation  class  of  90- 
1 3 OK ,  appears  for  CPUT/C/90- 1 30K  and  CPUl/ W/80-13CK. 

However,  Class  M  is  primarily  responsible  fcr  the  anomaly 
since  it  contains  93E-  of  the  jobs  causing  the  anomaly. 

(3)  Class  and  Authorization  Code  (Figures  19  and  20) 

(a)  CF  DT  distributions  are  distinct  fcr  all 
classes,  but  the  shapes  fcr  Classes  A,  E,  and  C  ate 
generally  more  similar  to  each  other  that,  are  those  of 
CPUT/Class/Fstimated  Execution  Time  or  CF tl/Class/Core 
allocation.  TET  distributions  are  distinct  tc  seme  degree, 
but  the  distributions  for  Classes  A,  E,  and  C  are  all  very 
flat  and  vary  much  less  than  the  corresponding  CPUT 
distributions  and  TRT/Class/Estiraated  Execution  Time  or 
T ET/C 1 ass/Core  Allocation.  Class  M  distributions  are  quite 
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similar  to  each  ether  for  3  of  the  4  classes  represented. 
This  is  an  expected  characteristic  of  Class  M,  as  it  is 
reserved  for  emergency  jobs  usually  involving  seme  repair  of 
the  system. 

(b)  As  is  expected  frem  previous  c b servat i ens , 
the  distribution  cf  the  percentage  cf  jobs  in  each  class  is 
poor. 

(c)  Only  2  classes  out  of  the  possible  24  for 
beth  CPUT  and  TRT  have  an  insufficient  number  cf  jots. 

(d)  Greater  than  7  C  %  and  307  of  the  coefficients 
of  variation  of  the  CPUT  and  TFT  distributions, 
respectively,  are  significantly  less  than  that  of  the 
relevant  unconditioned  distribution. 

(4 )  Authorization  Code  and  Fstiraated  Execution  Time 
{Figures  21  and  22) 

(a)  All  distributions  of  CPUT  are  distinct  bu+ 
dc  not  vary  in  the  height  of  the  peaks  as  greatly  as 
CPUT/Estimated  Execution  Time/Core  Allocation.  The 
distributions  cf  TFT  are  somewhat  distinct — 

TFT/fJndergrad  uate/420-  900sec.  and  TFT/Research/420-90Csec . 
can  be  combined — but  are  usually  very  flat  for  any  estimated 
execution  time  greater  than  6C  seconds. 

(fc)  The  relationship  between  CPUT  and  Estimated 
Execution  Time  as  discussed  in  (3)  { b )  of  One  Parameter 


Conditioning,  is  again  evident  here  foe  all  classes  of 
Authorization  Code,  hut  that  between  TRT  and  Estimated 
Execution  Time  is  less  so. 

(c)  The  distribution  of  the  percentage  of  jobs 
in  each  class  is  somewhat  uneven. 

( d )  Greater  than  15$  of  the  C  £  \)  1  and  TRT 
distributions  had  coefficients  of  variation  less  than  that 
of  the  correspc rdi ng  unconditioned  distribution. 

(e)  41  out  of  the  possible  48  classes  are  of 
sufficient  frequency  to  be  considered. 

(5 )  Authorization  Code  and  Core  Allocation  (Figures 
23  and  24) 

(a)  The  CPIJT  distributions  are  all  distinct  and 
show  more  variation  in  the  shapes  than  do  these  for 
CEUT/Author ization  Code/E st irca ted  Execution  Time,  but  less 
than  those  for  CP bl /Core  Allocation /Estimated  Execution 
Time.  Most  distributions  for  TRT  are  distinct-- 
TRT/Administraticn/130-200sec.  and  TRT/Cc mmerc ial/1 30-200 
can  be  combined — and  the  same  can  be  said  for  variation  as 
is  said  for  CPUT  distributions. 

(b)  'the  relationships  between  CUT,  TRT  and  Coro 
Allocation  mentioned  in  (4) (b)  of  Cne  Parameter 
Conditioning,  are  evident  here  again. 
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(c)  The  percentage  of  jobs  in  each  class  i s 
quite  uneven. 

{ d )  Greater  than  6  5^  of  the  classes  fcr  both 
CFUT  and  TFT  have  coefficients  of  variation  great er  *han 
that  of  the  relevant  unconditioned  distribution. 

(e)  1C  of  the  possible  48  classes  fcr  both  C PUT 
and  TFT  contain  an  insufficient  number  cf  jobs. 

(6)  Core_A llocation_and_Fs t  i  mated_  Execu  t ion_Tj_me 
{Figures  2C  and  26) 

(a)  CEHT  distributions  are  all  distinct  and  vary 
greatly  fcr  all  classes  with  frequency  greater  than  60. 

d istr ibut i cns  are  less  distinct,  particularly  fcr  classes 
with  an  estimated  execution  time  greater  than  160  sec ends- - 
TFT/200- 300K/240-4 20sec  .  and  TFT/13C-200K/240-420  sec .  ; 

TFT/ 2C  0-30  OK/4  2  C-9  00sec .  and  TFT/ 2 C C- 30  OK/ 90 C-4 3 2 OOscc .  can 
be  combined.  The  distributions  are  usually  very  flat  be yen! 
this  point. 

(b)  The  relationships  between  CPUT,  TFT,  Core 
Allocation,  and  Estimated  Fxecuticn  Time  discussed  in  (3)  (b) 
and  (4) (b)  of  One  Parameter  Conditioning,  are  evident  here 
again . 

(c)  The  classes  have  relatively  urequal 


percentages  of  gobs. 
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(d)  80‘S  and  6CTc  of  the  CPUT  and  TFT 
distributions,  respectively,  have  coefficients  cf  variation 
less  than  that  of  the  relevant  uncc nditioned  distribution. 

(e)  IF  out  of  the  possible  64  classes  for  both 
CPUT  and  TPT  contain  an  insufficient  number  cf  jobs. 


Distribution  Fitting 

Further  processing  cf  the  data  involved  attempts  to 
fit  the  empirical  distributions  to  theoretical 
distributions.  Because  o^  the  previous  work  in  the  area  c f 
scheduling  and  service  time  d ist r ibu t iens  ,  and  because  of 
the  general  shape  cf  the  empirical  distributions,  the 
theoretical  distributions  considered  were: 

(1)  exponential, 

(2)  hyperexponential, 

(3)  gamma, 

(4)  Weibull, 

(5)  log-normal, 

(6)  Peta,  and 

(7)  convolution  of  other  distributions. 

We  will  discuss  the  treatment  and  results  cf  each. 

(1)  Fxpcnent  ia  1 


The  exponential  density  distribution  was  produced  in 
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the  main  proq  r  am  described  in  Chapter  T  T ,  a  lcn  g  with  the1 
chi-square  value. 

Results 

The  expcnential  distribution  is,  generally,  far  from 
satisfactory.  Only  in  very  isolated  cases,  does  the 
empirical  distribution  have  an  expcnential- like  shape. 
Figure  27  illustrates  the  difference  between  the  empirical 
and  the  exponential  distribution,  in  the  best  cf  these 
isolated  cases. 

(2)  a  11  3  (3)  Gamma 

Here  also,  the  theoretical  density  distributions  viplo 
produced  in  the  main  program,  with  chi-square  values. 

Tables  reflecting  cumulative  prct ab i 1 it ies  for  various 
values  of  the  parameters  were  also  produced. 

Results 

Again,  the  theoretical  density  distributions  are  very 
different  from  the  empirical  distributions.  Figure  28 
depicts  this  difference  in  one  of  the  better  fitting 
distributions.  The  tables  do  not  prove  to  be  any  more 
satisfactory.  Upon  comparing  the  cumulative  probabilities 
cf  the  theoretical  versus  the  empirical,  we  find  that  two 


conditions  exist: 
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(1)  Matching  of  the  peaks  produce  very  poor  results 
in  the  tail  area.  In  fact,  tie  theoretical  distributions 
lose  their  tails  very  quickly  as  compared  to  the  extended 
tails  of  the  empirical  distributions. 

( d )  Matching  of  the  tails  p  r  c  1  u  c  e  very  poor  results 
in  the  peak  area.  The  proportion  of  jots  in  the  peak  of  t 
theoretical  distribution  is  insignificant  when  compared  to 
the  proportion  of  jobs  in  the  peak  of  the  empirical 
distribution. 

Results  for  the  gamma  distribution  arc  similar  tor 
estimation  of  its  parameters  by  both  the  mean  and  standard 
deviation,  and  the  mean  and  maximum. 


(U) 


In  separate  programs,  tables  of  cumulative 
probabilities  were  generated  for  various  values  of  the 
parameters  of  the  distribution,  and  theoretical 
distributions  were  produced  based  on  the  regression  method 
of  estimating  Weibull  parameters. 

ng  suits 

Tie  fits  here  are  very  much  like  these  ct  the  gamma 
listi'i  ration.  Through  examination  of  the  tables,  ai  d  with 
composite  of  Weibull  distributions: 
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where  aa  =  0  and  ann  +  1  - 


a  poor  fit  was  obtained,  which  is  the  best  thus  far.  Figure 
29  gives  an  idea  cf  the  fit  resulting  frem  a  composite  of  5 
distributions. 


(5)  L  r  jra  1 

Little  was  dene  with  this  distribution.  This  is  an 
area  for  further  investigation;  the  tail  may  he  thicker  than 
that  of  either  the  Weibull  or  qamma. 

(6)  Beta 

This  distribution  was  considered  but  no  research  with 
actual  data  was  performed.  This  is  a  candidate  for  future 
investigation  because: 

(a)  its  general  shape  seems  to  have  a  mete  rapid  drop 
after  the  maximum  value  than  the  gamma  distribution,  and 
consequently,  perhaps  a  mere  extended  tail.  See  Figure  31. 

(b)  it  is  finite,  which  is  a  charac teristic  of 
empirical  d istr ibu tions. 


(7)  Convc  lu  t  icn_cf_0  ther_DistribuHgns 


CM 


03 

a 

o 

*H 

4J 

P 

4H  ,£3 
O  *H 
3-i 

a)  -i-i 

■M  03 
•H  *rH 
03  Q 
O 

PH  pH 

6  rH 
O  P 
U  ^ 
•H 
0) 
•S' 


03 

P 

O 

•H 

4H 

P 

-Q 

•H 

4-1 

03 

p 


p 

-a 

•H 

0) 


« — i 

a3 

< 

> 

U 

1 - 1 

o 

25 

Q) 

+ 

H 

o 

4-t 

•H 

o 

o 

o 

M 

P 

03 

• 

• 

• 

• 

H 

H 

M 

o 

' — 1 

o 

M 

P 

rH 

Ph 

pq 

Q3 

M 

6 

Pd 

* 

•H 

•H 

o 

o 

o 

o 

o 

H 

H 

03 

• 

• 

• 

• 

• 

M 

c/3 

CM 

o 

o 

rH 

M0 

o 

H 

w 

1 — 1 

P3 

p 

W 

PQ 

Pi 

M 

p 

P 

Pd 

O 

H 

CJ 

M 

CO 

hH 

P 

M 

M 

Pi 

C3 

o 

00 

M0 

o 

o 

P 

M 

4-t 

03 

pq 

' 

PH 

cu 

aj 

m 

CO 

in 

>n 

cn 

t-1 

S 

6 

P 

pJ 

w 

o3 

pH 

P 

P 

o3 

PQ 

03 

> 

o 

M0 

CM 

o 

o 

M 

PH 

*H 

W 

< 

r— 1 

rH 

rH 

CO 

<r 

is 

H  CM  CO  M  03 


CT> 


00 


CT> 


CO 


Time  (seconds)  FIGURE  29. 


72 


It  is  popsihle  that  the  empirical  cl  i s 1 1  i  k u t  i o n s  are 
mixtures  cf  ether  distributions,  such  is 

(a)  convolution  of  an  exponential  d  i  s  t  r  i  hu  +  i  ■>!  and  \ 
hyperexpenont ial  distribution,  and 

(h)  convolution  cf  an  Frlang-k  list ri  hut  ion  u.  1  i 
h y  no roxpenential  d i s t  r  i  h  u  t i c  n . 

ho  research  was  performed  lr  this  area. 

Distribution  Char act er i  za  t ions 

The  mean  residual  life  and  rark  values  were  calculate 
f  c  r  most  of  the  empirical  distributions. 

Results 

hot h  the  mean  residual  life  and  rank  are  generally 
increasinc  with  seme  intervals  cf  decreasin'!  values.  The 
values  for  mean  residual  life  arc  consistently  and 
significantly  higher  than  those  for  rark,  at  coi respon  line 
points  of  the  distributions.  Figures  Id  and  31  give  tyri<  t 
a  r:  1  unusual  results  for  each  function. 
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Conclusions 


Eest  Partitioning  Parameters 

Our  results  indicate  that  certain  parameters,  alcne 
and  when  included  with  another,  partition  CPU  times  and 
total  run  times  into  more  meaningful  distributions  than  1o 
others.  To  determine  which  parameters  yield  the  best 
results,  we  consider  the  following: 

(1)  The  amount  of  variation  frem  cne  distribution  to 
another.  This  is  the  most  important  criterion.  Cne  factor 
in  determining  the  amount  of  variation  is  the  number  cf 
classes  with  a  distribution  indistinguishable  frem  that  of 
seme  other  class.  This  is  relevant  only  in  the  two 
conditioning  parameter  case. 

(2)  The  percentage  of  gobs  comprising  each 
distribution.  Discrete  parameter  classes  are  formed  through 
predetermined  criteria.  For  example.  Class  is  specified  by 
A,  E,  C,  and  K.  No  other  adjustment  is  made  tc  the  classes 
other  than  combining  them  if  indistinguishable  distributions 
do  occur.  Consequently,  if  a  very  large  percentage  of  the 
jebs  fall  within  a  particular  class,  the  class  has  mere 
possibility  of  being  a  joint  distribution  cf  ether  submerged 
distributions.  With  continuous  parameters,  this  factor  is 


not  important  due  to  the  amount  cf  interval  investigation 
performed  in  choosing  class  boundaries.  This  factor  is  of 
much  less  importance  under  two  parameter  conditioning, 
particularly  so  when  a  discrete  parameter  is  combined  with  a 
continuous  parameter. 

(3)  The  number  of  classes  with  sufficient  frequency. 
The  greater  the  number  cf  classes  cf  a  parameter  implies  th<> 
greater  the  number  of  distinct  d ist r i b u t io r s  in  one 
parameter  conditioning.  However,  in  two  parameter 
conditioning,  a  number  of  the  resulting  classes  may  have  low 
frequency,  and  thus  cannot  be  considered  as  a  separate 
distribution. 

In  the  following  discussions,  parameter  references 
apply  to  both  CPU  time  and  total  run  t  i  :;»e  distributions, 
unless  otherwise  specified. 

Resu  1 1  s_c f  _One_Cc  nd i  t  i on  i.  rq_Pa r ame  te r 

The  amount  of  variation  differs  from  parameter  to 
parameter.  Our  results  indicate  that  Core  Allocation 
produces  the  greatest  variation,  with  Estimated  Execution 
Time  performing  almost  as  well.  Class  and  Author izat ion 
Code  ate  much  less  significant  in  this  respect,  with  Class 
probably  the  better  of  the  two.  We  conjecture  that  the 
dependence  of  Core  Allocation  or  operating  system  procedures 
may  be  responsible  in  part  for  the  greater  variation  among 
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distributions  for  that  parameter  than  among  the 
distributions  for  Estimated  Execution  Time. 

When  considering  the  evenness  of  the  distribution  of 
jobs  over  the  classes,  we  find  that  generally,  the 
continuous  parameters  tend  to  te  tetter  than  the  discrete 
parameters.  Estimated  Execution  Time  definitely  has  the 
most  uniform  distribution  and  Class,  the  least  uniform 
distribution. 

All  classes  for  each  parameter  have  sufficient 
f  reguency . 

Pesul ts_gf_Twg_Condi t ion ing_ Par a  meters 

Because  of  the  large  number  of  classes  in  each  two 
parameter  set,  it  is  more  difficult  tc  determine  which 
combinations  of  parameters  produce  the  greater  variation  in 
their  distributions.  Generally,  the  distributions  resulting 
frcm  CPU  time  conditioning  are  much  mere  variable  than  those 
resulting  frcm  total  run  time  con d i t ic n ing .  Core  Allocation 
and  Estimated  Execution  ^ime  partition  CPU  time  letter  than 
other  combinations.  However,  total  run  time  is  best 
partitioned  by  Core  Allocation  and  Authorization  Code, 
implying  that  the  type  of  user  is  of  importance  when 
considering  I/O  times.  Class  combined  with  Authorization 
Cede  produces  the  poorest  distribution.  We  ncte  that  Core 
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Allocation  combined  with  Class  or  Authorization  Cede 
conditions  significantly  tetter  than  dees  Estimated 
Execution  Time  with  Class  or  Authorization  Code. 

Upon  considering  which  parameters  produce  classes  of 
equal  frequency,  we  find  that  continuous  parameters  in 
combination  with  discrete  parameters  have  mere  even 
distributions  than  the  combination  of  the  discrete 
parameters. 

Once  classes  with  insufficient  rumters  c £  jobs  in  them 
are  eliminated,  the  number  of  classes  for  the  combinations 
of  Estimated  Execution  Time  and  Authorization  Cede,  and 
Estimated  Execution  Time  and  Core  Allocation,  are 
approximately  the  same.  Authorization  Cede  and  Core 
Allocation  have  fewer  classes.  Class  in  cc mb i n at  ion  with 
any  of  the  other  parameters,  has  roughly  the  same  number  of 
classes. 

^r  cm  our  previous  discussions,  we  conclude  that  the 
best  conditioning  parameter  is  Core  Allocation,  with 
Estimated  Execution  Time  and  Authorization  Code  performing 
almost  as  well.  The  poorest  is  Class.  We  also  conclude 
that  two  parameter  conditioning  yields  tetter  results  than 
one  parameter  conditioning. 
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One  other  result  of  partitioning  is  worth  noting  here, 
that  pertaining  to  the  reduction  of  coefficients  in 
variation.  In  all  conditioning  results,  the  coefficients  of 
variation  are  reduced  from  these  of  the  unconditioned 
distributions  in  a  significant  nuirfcer  of  classes.  Very  few 
were  noticeably  higher.  Thus  we  can  also  conclude  that 
partitioning  dees  lower  the  coefficient  of  variation 
significantly  in  the  majority  cf  the  partitioned 
d istr ibu  t ions. 

These  conclusions  are  relevant  only  to  cur  operating 
system  and  the  time  period  over  November  and  December  of 
1972.  Tn  another  system  or  within  another  time  period, 
different  results  may  be  obtained.  Additional  parameters 
could  and,  if  possible,  should,  be  considered  in  future 
studies.  One  that  may  produce  interesting  results  is  a 
parameter  indicating  the  translator  required  for  the 
program.  Another  involves  more  refinement  in  the  type  of 
user  area,  i.e.,  relevant  to  our  data,  more  distinction  as 
to  the  type  of  program  submitted  in  each  cf  the  classes  of 
Authorization  Cede.  Perhaps  estimates  of  the  actual  service 
times  rather  than  the  upper  limits  of  the  service  times, 
would  prove  meaningful. 


Distribution  Fit 
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The  empirical  distributions  dc  not  fit  satisfactorily 
to  any  of  the  theoretical  distributions  based  on  our  methods 
of  parameter  estimation.  Very  peaked  distributions  have 
little  breadth;  most  distributions  have  too  large  a 
proportion  of  jobs  lying  in  the  tails.  We  feel  that 
research  involving  different  methods  of  parameter  estimation 
of  the  theoretical  di st r ibu ticns ,  may  be  productive. 
Investigation  of  the  log-normal,  Eeta,  and  ccnvcluticn  of 
distributions  would  be  worthwhile.  One  alternative  approach 
to  this  problem  is  to  fit  the  functions  that  characterize 
the  empirical  dist r ibutions,  such  as  failure  rate,  mean 
residual  life,  or  rank. 


Implications  in  the  Scheduling  Discipline  Area 

Our  results  do  imply  that  a  priori  characteristics  cf 
a  job  can  be  used  to  acguire  information  about  the 
processing  time  distribution  of  that  job  and  lower  the 
coefficient  of  variation  cf  the  distribution  significantly. 
Partitioning  on  more  than  one  parameter  is  mere  significant 
in  a  multiprogramming  environment  than  in  a  single  task 
environment,  where  the  effects  of  varying  I/C  requirements 
are  felt.  These  results  do  have  a  direct  effect  on  those 
scheduling  disciplines  that  use  this  information  in  their 
selection  rule,  i.e.  SEFT,  SEEPT,  and  SF. 
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To  effectively  use  this  information,  these  disciplines 
must  store  varying  amounts  of  inf cr ma ticn .  SFFFT  and  SP 
must  store  the  greatest  amount,  as  they  depend  on  preemption 
and  thus  information  gained  during  the  processing  of  a  job:. 
But  although  these  rules  may  perform  tetter  than  others  with 
regard  to  response  times  and  ether  considerations, 
preemption  and  storage  costs  may  decrease  si gn  i  f ican 1 1 y  the 
overall  benefit  of  using  these  scheduling  disciplines. 

Before  the  full  significance  of  our  results  can  be 
determined,  further  research  must  be  directed  at  studying 
the  effects  of  these  costs,  through  simulation,  through 
actual  system  investigation,  or  through  further  analysis. 
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